Hacker News new | past | comments | ask | show | jobs | submit login

Agreed, this should not be used for anything mission critical unless you're going to sit there and carefully review the output by hand (although that is still going to be 100x faster than trying to manually correct the raw OCR output).

Where it's most useful to me personally is when I want to read some old book from the 1800s about the history of the Royal Navy [0] or something like that which is going to look really bad on my Kindle Oasis as a PDF, and the OCR version available from Archive.org is totally unreadable because there are 50 typos on each page. The ability to get a nice Markdown file that I can turn into an epub and read natively is really nice, and now cheap and fast.

[0] https://archive.org/details/royalnavyhistory02clowuoft/page/...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: