No: https://github.com/0xnurl/gpts-cant-count

sebzim4500 · 2024-12-23T12:43:08 1734957788

I can't reliably multiply four digit numbers in my head either, what's your point?

reshlo · 2024-12-23T12:45:57 1734957957

Nobody said you have to do it in your head.

sebzim4500 · 2024-12-23T13:38:13 1734961093

That's the equivalent to what we are asking the model to do. If you give the model a calculator it will get 100%. If you give it a pen and paper (e.g. let it show it's working) then it will get near 100%.

reshlo · 2024-12-23T19:19:55 1734981595

Citation needed.

sebzim4500 · 2024-12-24T09:44:16 1735033456

Which bit do you need a citation for? I can run the experiment in 10 mins.

reshlo · 2024-12-24T11:12:01 1735038721

> That's the equivalent to what we are asking the model to do.

Why?

What does it mean to give a model a calculator?

What do you mean “let it show its working”? If I ask an LLM to do a calculation, I never said it can’t express the answer to me in long-form text or with intermediate steps.

If I ask a human to do a calculation that they can’t reliably do in their head, they are intelligent enough to know that they should use a pen and paper without needing my preemptive permission.