Hacker News new | past | comments | ask | show | jobs | submit login

Given every other iteration has basically just been the same thing but bigger, why should we think this?



My point was to caution against being too confident about the underlying architecture, not to argue for any particular alternative.

Your statement is false - things changed a lot between gpt4 and o1 under the hood, but notably not a larger model size. In fact the model size of o1 is smaller than gpt4 by several orders of magnitude! Improvements are being made in other ways.




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: