Which is my point. You need a ton of specialized knowledge to use compute effect...

sigmoid10 · 2025-02-21T07:52:20 1740124340

Then how does this invalidate the bitter lesson? It's like you're saying if aerodynamics were true, we'd have planes flying like insects by now. But that's simply not how it works at large scales - in particular if you want to build something economical.

llm_trw · 2025-02-23T02:42:39 1740278559

Because is the bitter lesson were true no one would be wasting their time with convolutions or attention blocks. You'd just replace them with the general tensor that allows every hyper relation possible between all points instead.