Standard datasets can no longer be used for benchmarking against LLMs since they...

47282847 9 months ago | parent | context | favorite | on: Show HN: LLM-aided OCR – Correcting Tesseract OCR ...

Standard datasets can no longer be used for benchmarking against LLMs since they have already been fed into it and are thus too well-known to compare to lesser known documents.