PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 17 days ago • 78 • 5
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 17 days ago • 78
Sleeping 4 4 Doc2Page - Document to Webpage Converter 🏄 Convert docs to webpages using PaddleOCR and ERNIE
Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs Paper • 2510.01954 • Published Oct 2 • 12
view article Article Unleashing the Full Potential of ERNIE4.5 using FastDeploy By baidu and 3 others • Sep 19 • 11
view article Article PP-OCRv5 on Hugging Face: A Specialized Approach to OCR By baidu and 5 others • Sep 10 • 108
CompassJudger-1: All-in-one Judge Model Helps Model Evaluation and Evolution Paper • 2410.16256 • Published Oct 21, 2024 • 60