Time Blindness: Why Video-Language Models Can't See What Humans Can? Paper • 2505.24867 • Published May 30 • 80
Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark Paper • 2503.20786 • Published Mar 26 • 2
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding Paper • 2502.14949 • Published Feb 20 • 9