Seems like accuracy is the next killer feature for LLM search and teaching, will...

LeonenTheDK · on April 13, 2023

What a time to be alive where we likely need wait only a few months for the next big hurdle to be accomplished.

Exhilarating and terrifying at the same time.

kedean · on April 13, 2023

I dunno about that in this case. The "confidently incorrect" problem seems inherent to the underlying algorithm to me. If it were solved, I suspect that would be a paradigm shift of the sort that happens on the years scale at best.

mrtranscendence · on April 13, 2023

Yes, the "confidently incorrect" issue will be a tough nut to crack for the current spate of generative text models. LLMs have no ability to analyze a body of text and determine anything about it (e.g. how likely it is to be true); they are clever but at bottom can only extrapolate from patterns found in the training data. If no one has said anything like "X, and I'm 78% certain about it", then it's tough to imagine how an LLM could generate reasonably correct probability estimates.

og_kalu · on April 14, 2023

What you're alluding to is calibration and base gpt-4 had excellent calibration before RlHF.