Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
prithivMLmodsΒ 
posted an update 4 days ago
Post
2309
olmOCR [Allen AI] just got an upgrade! πŸ“ˆπŸ§‘β€πŸ³

The allenai/olmOCR-7B-0725 β€” fine-tuned with allenai/olmOCR-mix-0225 on top of Qwen/Qwen2.5-VL-7B-Instruct, pushing the boundaries of OCR technology. It takes a single document image as input, with the longest side resized to 1288 pixels. High-quality, openly available approach to parsing pdfs and other complex documents optical character recognition.

Try the demo here: prithivMLmods/Multimodal-OCR

✨ Model: allenai/olmOCR-7B-0725
✨ Model [fp8]: allenai/olmOCR-7B-0725-FP8
✨ Multimodal Implementations Space Collection: prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0

.
.
.
To know more about it, visit the model card of the respective model. !!
In this post