I was having a look at the DeepSeek-R1 technical report and found the "aha momen...

nvtop · 2025-02-07T10:26:03 1738923963

I'm also very skeptical of the significance of this "aha moment". Even if they didn't include chain-of-thoughts to the base model's training data (unlikely), there are still plenty of it on the modern Internet. OpenAI released 800k of reasoning steps which are publicly available, github repositories, examples in CoT papers... It's definitely not a novel concept for a model, that it somehow discovered by its own.

tmnvdb · 2025-02-07T10:34:39 1738924479

https://oatllm.notion.site/oat-zero