That's true for vanilla LLMs, but also keep in mind that there are no details ab...

catmanjan · 2024-12-21T10:26:52 1734776812

Given every other iteration has basically just been the same thing but bigger, why should we think this?

bubblyworld · 2024-12-21T15:52:23 1734796343

My point was to caution against being too confident about the underlying architecture, not to argue for any particular alternative.

Your statement is false - things changed a lot between gpt4 and o1 under the hood, but notably not a larger model size. In fact the model size of o1 is smaller than gpt4 by several orders of magnitude! Improvements are being made in other ways.