Hacker News new | past | comments | ask | show | jobs | submit login

4o isn’t really comparable to deepseek r1. Use o3-mini-high or o1 if you wanna stay near the state of the art.



I've had a coding project where I actually preferred 4o outputs to DeepSeek R1, though it was a bit of a niche use case (long script to parse DOM output of web pages).

Also they just updated 4o recently, it's even better now. o3-mini-high is solid as well, I try it when 4o fails.

One issue I have with most models is that when they're re-writing my long scripts, they tend to forget to keep a few lines or variables here or there. Makes for some really frustrating debugging. o1 has actually been pretty decent here so far. I'm definitely a bit of a power user, I really try to push the models to do as much as possible regarding long software contexts.


Why not use a tool where it can perform pricision edits rather than rewrite the whole thing? Eg. Windsurf or Cursor




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: