Vision transformers are good enough that you can use them alone even on cursive ...

Sidneys1 · 2024-08-09T21:00:55 1723237255

I'd like to hear more about this! I keep coming back to trying to OCR my journals, but nothing I've tried so far works well (enough) on handwriting.

katzinsky · 2024-08-09T21:46:02 1723239962

A couple of other people in the thread are using it too apparently. They're the Microsoft TROCR models. You do need a moderate amount of software to deskew, process, and segment the image before handing it to the model but after that it's typically extremely accurate in my experience.

Setting up my software online and monetizing it is next in the queue after my current side project. Although I haven't checked the model licenses.

mewpmewp2 · 2024-08-10T12:32:33 1723293153

Have you tried uploading image of your handwriting to ChatGPT interface with ChatGPT 4o?

And what the results were? And if not could you try and let us know what the results are.

Sidneys1 · 2024-08-10T16:57:29 1723309049

Not with 4o, but I tried it with 4 (through Copilot) a while ago and the results were abysmal, even with very neatly printed handwriting.

mewpmewp2 · 2024-08-11T12:25:45 1723379145

Try again with 4o through the ChatGPT interface. Since I am getting very good results. I don't think gpt 4 was multimodal like gpt4o so must have used some other methodology?