>I'm incredibly surprised no one mentions this If you don't see anyone mentionin...

Workaccount2 · 2025-04-07T14:12:19 1744035139

On top of that, what the model prints out in the CoT window is not necessarily what the model is actually thinking. Anthropic just showed this in their paper from last week where they got models to cheat at a question by "accidentally" slipping them the answer, and the CoT had no mention of answer being slipped to them.