Misraj Open Data Collection This collection contain an open source data has been collected and processed by Misraj team • 3 items • Updated 24 days ago • 6
Mutarjim: Advancing Bidirectional Arabic-English Translation with a Small Language Model Paper • 2505.17894 • Published May 23 • 219
KITAB-Bench Collection A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding • 24 items • Updated Feb 24 • 14
VLM-R1 Collection Multimodal Reasoning Dataset for Large Scale Training with DeepSeek-R1 thoughts style • 18 items • Updated Apr 14 • 1
Sadeed: Advancing Arabic Diacritization Through Small Language Model Paper • 2504.21635 • Published Apr 30 • 59
Sadeed: Advancing Arabic Diacritization Through Small Language Model Paper • 2504.21635 • Published Apr 30 • 59