More

muragekibicho · 2025-01-26T15:42:51 1737906171

Haha, it's wild I get the reference. Amazing stuff MrCoffee!

muragekibicho · 2025-01-26T11:26:43 1737890803

Here's the Twitter discussion if you'd like to contribute there : https://x.com/murage_kibicho/status/1883473056989172120

belter · 2025-01-26T12:02:54 1737892974

Why not BlueSky?

muragekibicho · 2025-01-26T15:43:42 1737906222

I don't have bluesky. I can get it. I just don't know where Tpot is on Bluesky Perhaps you could direct me.

muragekibicho · 2025-01-14T09:35:47 1736847347

Introduction : Finite Field Assembly is a programming language that lets you emulate GPUs on CPUs

It's a CUDA alternative that uses finite field theory to convert GPU kernels to prime number fields.

Finite Field is the primary data structure : FF-asm is a CUDA alternative designed for computations over finite fields.

Recursive computing support : not cache-aware vectorization, not parallelization, but performing a calculation inside a calculation inside another calculation.

Extension of C89 - runs everywhere gcc is available. Context : I'm getting my math PhD and I built this language around my area of expertise, Number Theory and Finite Fields.

almostgotcaught · 2025-01-18T01:15:20 1737162920

> I'm getting my math PhD and I built this language around my area of expertise, Number Theory and Finite Fields.

Your LinkedIn says you're an undergrad that took a gap year 10 months ago (before completing your senior year) to do sales for a real estate company.

pizza · 2025-01-18T01:42:26 1737164546

Why bother doing a witch hunt and leaving out that they did Stats at Yale..

almostgotcaught · 2025-01-18T01:48:42 1737164922

Because why does it matter? Are you suggesting undergrad stats at Yale is comparable to a PhD in number theory?

pizza · 2025-01-18T02:56:43 1737169003

I guess it's not clear to me why it's even interesting to talk about their LinkedIn or their PhD in the first place? It's not like not having a PhD will make the work any more true or not. Wouldn't it be more interesting to discuss the merits of the post? There's really little point in trying to say that their LinkedIn has different info than the comment therefore the submission is invalid.

But, suppose I did actually hold that belief for some reason, then it would seem fairly intellectually dishonest to withhold relevant info in my pointed inquisition wherein I just characterize them as someone lacking mathematical experience at all, let alone from a world class university. But maybe that's just me!

foota · 2025-01-18T04:17:12 1737173832

I think they're pointing it out because stating they're working towards a PhD in something when they've just graduated and don't seem to be (as far as we can tell) enrolled in a PhD program is misleading. Note that the parent isn't the one that brought up their PhD, the author is, presumably to head off the big question marks everyone reading this got as to what it is.

It's unclear whether this page is something that could be useful, and deserves attention. The fact that the author is at best making misleading statements is useful in determining whether you should take their claims at face value.

They claim "Finite Field Assembly is a programming language that lets you emulate GPUs on CPUs".

It's not a programming language, it's a handful of C macros, and it doesn't in any way emulate a GPU on the CPU. I'll be honest, I think the author is trying to fake it till they make it, they seem interested in mathematics but their claims are far beyond what they've demonstrated, and their post history reveals a series of similar submissions. In so far as they're curious and want to experiment I think it's reasonable to encourage, but they're also asking for money and don't seem to be delivering much.

Why would they post the 4th article in a series where the previous ones require you to pay?

almostgotcaught · 2025-01-18T06:18:47 1737181127

> I guess it's not clear to me why it's even interesting to talk about their LinkedIn or their PhD in the first place?

Am I taking crazy pills? I didn't bring it up, the guy himself, here at the top of this very thread branch, wrote specifically explicitly that he's a PhD student working on number theory.

> Wouldn't it be more interesting to discuss the merits of the post?

There is no merit, nothing to discuss. I linked the corresponding GitHub below so you can judge for yourself.

saghm · 2025-01-18T01:37:10 1737164230

Depending on what properties they sold, they certainly could have gotten valuable real-world expertise with finite fields. It's certainly easier to sell them than infinite ones!

saagarjha · 2025-01-18T01:18:44 1737163124

Are you sure that’s their LinkedIn?

almostgotcaught · 2025-01-18T01:23:20 1737163400

Why wouldn't it be? All of the pics, names and details line up between GitHub, here, Reddit, and substack.

zeroq · 2025-01-18T00:12:12 1737159132

I've read this and I've seen the site, and I still have no idea what it is, what's the application and why should I be interested.

Additionally I've tried earlier chapters and they are behind a paywall.

You need a better introduction.

pizza · 2025-01-18T00:59:34 1737161974

This is phrased in a kind of demanding way to an author who has been kind enough to share their novel work with us. Are you sure you spent enough time trying to understand?

Conscat · 2025-01-18T01:15:32 1737162932

It seems that pretty much everybody here is confused by this article. One user even accused it of LLM plagiarism, which is pretty telling in my opinion.

I for one have no clue what anything I read in there is supposed to mean. Emulating a GPU's semantics on a CPU is a topic which I thought I had a decent grasp on, but everything from the stated goals at the top of this article to the example code makes no sense to me.

pizza · 2025-01-18T01:37:10 1737164230

It just seems like residue numbering systems computation, which I'm already working with.

muragekibicho · 2024-12-03T15:21:13 1733239273

We're hosting an AI Advent of Code event. You'll spend Christmas implementing AI papers in C, Zig, Haskell et al.

It's free and I hope you guys are interested.

sva_ · 2024-12-03T16:00:22 1733241622

The signup sends me to a substack which sends me back to the page in a circle.

dpwm · 2024-12-03T16:10:26 1733242226

There don’t seem to be any challenges there yet, and the page states it doesn’t begin until December 9th.

Arainach · 2024-12-03T16:18:23 1733242703

So it's not even aligned with Advent...

muragekibicho · 2024-11-22T17:16:06 1732295766

There's a glitch in the velocity level. My marble keeps respawing and the game is stuck in a refresh loop

biomcgary · 2024-11-22T17:18:54 1732295934

On the last level I ran into one of the raised blocks and tunneled into the cubes. I could still roll around inside, lol, but there was no way out.

muragekibicho · 2024-08-04T16:31:50 1722789110

Somewhat related. Your QOI lossless file format coupled with 7Zip outperfoms lossless PNG. Amazing work!

pornel · 2024-08-04T16:44:12 1722789852

BMP coupled with 7Zip would outperform too (probably by a bigger margin). It just boils down to gzip vs gzip-replacement compressor.

zX41ZdbW · 2024-08-04T17:36:08 1722792968

I also found that BMP with ZSTD outperforms PNG while developing https://adsb.exposed/ (it streams raw RGBA over HTTP with Content-Encoding: zstd)

vanderZwan · 2024-08-04T21:46:32 1722807992

Not to mention the part where adding compressors like this somewhat defeats the purpose of using a simple format like QOI (although at least zstd is faster than gzip, let alone 7zip).

But if we're modifying things like that, then they might as well make use of Nigel Tao's improved QOIR format, and replace the LZ4 compressor it uses with zstd. That's probably faster and likely compresses better than QOI.

[0] https://nigeltao.github.io/blog/2022/qoir.html

[1] https://github.com/nigeltao/qoir

rurban · 2024-08-06T06:15:42 1722924942

Replacing lz4 with zstd will give you half the speed, with just better compression. https://gregoryszorc.com/blog/2017/03/07/better-compression-...

vanderZwan · 2024-08-07T19:53:26 1723060406

So to clarify: my suggested point of comparison was replacing QOI + 7Zip of GP with QOIR + zstd. QOIR already compresses better than QOI before the LZ4 pass, and zstd compresses faster than 7zip and often better. On top of that you can put zstd in the header option when streaming data on a browser so you don't need to increase the JS bundle or whatever if the use case is the web. So that's basically a guaranteed net improvement all around.

Second of all, the speed/compression trade-off with zstd can be tuned a lot. The "half as fast as LZ4" stat is for the fastest setting, but for the proposed comparison point of 7zip a slower setting with better compression ratio is likely perfectly fine.

[0] https://github.com/facebook/zstd/tree/dev?tab=readme-ov-file...

muragekibicho · 2024-06-28T13:32:53 1719581573

Lol I saw your tweet

muragekibicho · on April 20, 2024

I saw something similar in Bit Twiddling hacks. Out of utter curiosity, when would you need to interleave bits in prod? Is it something a Saas dev would be doing or maybe sb in embedded programming?

tubs · on April 20, 2024

To expand on azornathogron's answer, when you are working with 2d data (it generalises to 3d too!) you often want to filter pixels in a rectangular area. This is commonly for bilinear filtering or some kind of convolution kernel.

If you don't interleave the bits and have large textures (think 4096 pixels wide) where each pixel is 4bytes big, that means there is a distance of 16kb between a pixel and the pixel below it.

This is super bad for caches (really important for the TLB in the MMU which is usually way smaller than data caches).

In GPU literature you'll see this called "tiling" (again like azornathogron said it's not always pure morton order), Intel document their tiling layout, here's an older layout doc:

https://docs.mesa3d.org/isl/tiling.html

azornathogron · on April 20, 2024

If you're doing low level graphics programming (processing pixel data) or in some other way dealing with 2D raster type data, you might want to work with data in Morton order or with Morton order tiles or something similar. Interleaving the bits of the x & y coordinate values helps to put pixels that are close together in your 2D space also close together in memory, which can help with making best use of caches.

See for example https://fgiesen.wordpress.com/2011/01/17/texture-tiling-and-... (search "Morton" in the page)

There are probably other use-cases that I'm unaware of.

pbsd · on April 20, 2024

Keccak (and other ciphers only using bit rotation and bitwise ops) can use bit interleaving to avoid slow 64-bit rotations on 32-bit hardware, by replacing 1 64-bit rotation by 2 independent 32-bit rotations on the interleaved words [1, §2.1].

[1] https://keccak.team/files/Keccak-implementation-3.2.pdf

chrchang523 · on April 20, 2024

Two-bit values are common in bioinformatics, and I’ve found the ability to efficiently convert between packed arrays of 1- and 2-bit values to be valuable in that ___domain.

gliptic · on April 20, 2024

https://en.wikipedia.org/wiki/Z-order_curve

muragekibicho · on April 13, 2024

I won't be surprised if moderator bias is in play. I saw 15 upvotes in 15 minutes. This post is planted at 1st position.

muragekibicho · on April 8, 2024

Thanks for the 20k virtual dollars. Lol I felt somewhat ambivalent spending the cash. This could(remotely) be a bitcoin moment.