Post
2309
olmOCR [Allen AI] just got an upgrade! ππ§βπ³
The allenai/olmOCR-7B-0725 β fine-tuned with allenai/olmOCR-mix-0225 on top of Qwen/Qwen2.5-VL-7B-Instruct, pushing the boundaries of OCR technology. It takes a single document image as input, with the longest side resized to 1288 pixels. High-quality, openly available approach to parsing pdfs and other complex documents optical character recognition.
Try the demo here: prithivMLmods/Multimodal-OCR
β¨ Model: allenai/olmOCR-7B-0725
β¨ Model [fp8]: allenai/olmOCR-7B-0725-FP8
β¨ Multimodal Implementations Space Collection: prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
.
.
.
To know more about it, visit the model card of the respective model. !!
The allenai/olmOCR-7B-0725 β fine-tuned with allenai/olmOCR-mix-0225 on top of Qwen/Qwen2.5-VL-7B-Instruct, pushing the boundaries of OCR technology. It takes a single document image as input, with the longest side resized to 1288 pixels. High-quality, openly available approach to parsing pdfs and other complex documents optical character recognition.
Try the demo here: prithivMLmods/Multimodal-OCR
β¨ Model: allenai/olmOCR-7B-0725
β¨ Model [fp8]: allenai/olmOCR-7B-0725-FP8
β¨ Multimodal Implementations Space Collection: prithivMLmods/multimodal-implementations-67c9982ea04b39f0608badb0
.
.
.
To know more about it, visit the model card of the respective model. !!