All that's happening is it finds 3 most commonly in the training set. When you p...

paulcole · 2024-07-24T23:42:38 1721864558

But then why does it stick to its guns on other questions but not this one?

brandall10 · 2024-07-25T00:54:31 1721868871

I haven't played with this model, but rarely do I find working w/ Claude or GPT-4 for that to be the case. If you say it's incorrect, it will give you another answer instead of insisting on correctness.

paulcole · 2024-07-25T03:46:09 1721879169

Wait what? You haven’t used 4o and you confidently described how it works?

brandall10 · 2024-07-25T13:44:53 1721915093

It's how LLMs work in general.

If you find a case where forceful pushback is sticky, it's either because the primary answer is overwhelmingly present in the training set compared to the next best option or because there are conversations in the training that followed similar stickiness, esp. if the structure of the pushback itself is similar to what is found in those conversations.

paulcole · 2024-07-25T16:46:19 1721925979

Right... except you said:

> If you say it's incorrect, it will give you another answer instead of insisting on correctness.

> When you push it, it responds with the next most common answer.

Which clearly isn't as black and white as you made it seem.

brandall10 · 2024-07-25T17:56:14 1721930174

I'll put it another way - behavior like this is extremely rare in my experience. I'm just trying to explain if one encounters it why it's likely happening.