Hugging Face reposted this
Are the latest VLM-Based OCR models better than "traditional" OCR systems? With new vision-language models for OCR dropping almost weekly, I wanted to create an easier way for GLAM professionals to evaluate/vibe check how existing OCR compares to newer VLM-based OCR. I previously shared a space which allowed you to upload your own images for testing, but I think it could be more useful to compare results across a larger number of images. To help with this, I've built OCR Time Capsule - a simple comparison tool using 11,000+Scottish school exam papers (1888-1963) from the National Library of Scotland as a test case. 📚 Dataset: https://lnkd.in/eWQBK8FZ 🔗 Browse Results: https://lnkd.in/eyX4zJhK 🔧 Process Your Own: https://lnkd.in/eq2U2F_q Key Features: - Visual page browser to quickly scan through documents - Side-by-side comparison of XML OCR vs VLM output - Quality metrics showing character-level improvements - Export functionality for further analysis 🚀 Next Steps: I'm planning to add more example datasets & OCR models using HF Jobs. Feel free to suggest collections to test with - I need image + existing OCR! Even better: if your institution has digitised collections, consider uploading them to Hugging Face! Would love to see more GLAM datasets on the Hub 🤗 Drop a comment with dataset suggestions or links! #DigitalLibraries #OCR #GLAM #DigitalHumanities #AI