I am sorry, but if this impresses you you are a rube. If this were a machine wit...

currymj · 2025-04-27T21:37:04 1745789824

> "hey, i am a COMPUTER and a small set of fixed moves should take me about 300ms or so to fully solve out"

from the article:

"3. Attempt to Use Python When pure reasoning was not enough, o3 tried programming its way out of the situation.

“I should probably check using something like a chess engine to confirm.” (tries to import chess module, but fails: “ModuleNotFoundError”).

It wanted to run a simulation, but of course, it had no real chess engine installed."

this strategy failed, but if OpenAI were to add "pip install python-chess" to the environment, it very well might have worked. in any case, the machine did exactly the thing you claim it should have done.

possibly scrolling down to read the full article makes you a rube though.

bobsmooth · 2025-04-27T20:50:54 1745787054

A computer program that has the agency to google a problem, interpret the results, and respond to a human was science fiction just 10 years ago. The entire field of natural language processing has been solved and it's insane.

otabdeveloper4 · 2025-04-28T05:59:46 1745819986

OpenAI's whole business is impressing you with whiz-bang sci-fi sound and fury.

This is a bad thing because it means they gave up on solving actual problems and entered the snake oil business.

dimatura · 2025-04-28T01:13:35 1745802815

Honestly, I think that if in 2020 you had asked me whether we would be able to do this in 2025, I would've guessed no, with a fairly high confidence. And I was aware of GPT back then.

mhh__ · 2025-04-27T20:46:04 1745786764

If you mean write code to exhaustively search the solution space then they actually can do that quite happily provided you tell it you will execute the code for them

jncfhnb · 2025-04-27T21:35:10 1745789710

Looks to me like it would have simulated the steps using sensible tools but didn’t know it was sandboxed out of using those tools? I think that’s pretty reasonable.

Suppose we removed its ability to google and it conceded to doing the tedium of writing a chess engine to simulate the steps. Is that “better” for you?