Hacker News new | past | comments | ask | show | jobs | submit login

An OCR will always mix up characters so I don’t really see the issue here?



Nope. Most compression does not mix up characters the way JBIG2 does (see the article), and most OCR does not substitute plausible text in for text it fails to scan.

Let's say the text is "The laptop costs $1,000 (one thousand dollars)." but the image is blurry.

Normal compression will give you an image where "$1,000" is blurry. JBIG2 can give you an image where "$1,000" has been replaced by a perfectly-clear "$7,000."

Normal OCR will give you some nonsense like "The laptop costs $7,000 (one 1housand dollars)". The LLM can "fix this up" to something more plausible like "The laptop costs $2,000 (two thousand dollars)."




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: