Hacker News new | past | comments | ask | show | jobs | submit login

How does this compare in terms of speed, quality, and price to sending images to VLMs like GPT-4o or Claude 3.5?



That's incredibly more expensive and time consuming. Also, I don't think it would do the markdown formatting and other things unless you specified all that in your prompts carefully. But the cost is going to be 1000x or something crazy, at least as of right now. These new mini models are dirt cheap-- you can keep them running non-stop for like $4 per HOUR.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: