> These LLMs are polymaths that can spit out content at a super human rate. Do y...

simonw · 2024-10-13T14:18:31 1728829111

Both the "count the Rs in strawberry" and the "multiply two large numbers" things have been solved for over a year now by the tool usage pattern: give an LLM the ability to delegate to a code execution environment for things it's inherently bad at and train it how to identify when to use that option.

vlovich123 · 2024-10-13T14:26:23 1728829583

I think the point is that playing whack a mole is an effective practical strategy to shore up individual weaknesses (or even classes of weaknesses) but that doesn’t get you to general reasoning unless you think that intelligence evolved this way. Given the adaptability of intelligence across the animal kingdom to novel environments never seen before, I don’t think that can be anything other than a short term strategy for AGI.

simonw · 2024-10-13T16:04:44 1728835484

Sure, LLMs won't ever get to general reasoning (for pick your definition of "reasoning") unassisted.

I think that adding different forms of assistance remains the most interesting pattern right now.

shagie · 2024-10-13T17:08:25 1728839305

(I did an earlier attempt at this with a "ok, longer conversation" ... and then did a "well, what if I just asked it directly?")

https://chatgpt.com/share/670bfdbd-8624-8011-bc31-2ba66eab3e...

I didn't realize that it had come that far with the delegating of those problems to the code writing and executing part of itself.

vlovich123 · 2024-10-13T13:49:39 1728827379

I think we’re in agreement. It’s going to take next generation architecture to address the flaws where the LLM often can’t even correct its mistake when it’s pointed out as with the strawberry example.

I still think transformers and LLMs will likely remain as some component within that next gen architecture vs something completely alien.