Anthropic has said they had benchmarks where Claude would take GitHub issues and...

godelski · 2025-02-08T05:02:32 1738990952

Are you sure you responded to the right comment? We were talking about code verification

cma · 2025-02-08T06:01:30 1738994490

I missed that it was about formal verification, but don't think formal verification is necessary for effective RL in the coding ___domain.

godelski · 2025-02-08T23:54:19 1739058859

There's still an important to what I said, even in ML. In fact, consider if what I said is true then ask what that would mean for how the current status quo goes about showing things. Then think about AI safety lol