I have distilled models before, I know how it works. They may have used o1 or o3...

eightysixfour 72 days ago | parent | context | favorite | on: GPT-4.5

I have distilled models before, I know how it works. They may have used o1 or o3 to create some of the synthetic data for this one, but they clearly did not try and create any self-reflective reasoning in this model whatsoever.