Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports Paper • 2603.09896 • Published 7 days ago • 25
Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders Paper • 2603.06569 • Published 11 days ago • 105
view article Article Demystifying Multimodal Learning: The Hidden Inefficiency in Vision Language Modelling 13 days ago • 4
view article Article Demystifying Multimodal Learning: Enabling Vision in Language Models 28 days ago • 4
Adapting Vision-Language Models for E-commerce Understanding at Scale Paper • 2602.11733 • Published Feb 12 • 13
Meeting SLOs, Slashing Hours: Automated Enterprise LLM Optimization with OptiKIT Paper • 2601.20408 • Published Jan 28 • 1
Self-Hinting Language Models Enhance Reinforcement Learning Paper • 2602.03143 • Published Feb 3 • 30
Vision Language Leaderboards Collection This collection has all the vision language leaderboards. • 7 items • Updated Aug 24, 2024 • 22
Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation Paper • 2505.06027 • Published May 9, 2025 • 18
LMMs-Eval: Reality Check on the Evaluation of Large Multimodal Models Paper • 2407.12772 • Published Jul 17, 2024 • 35
Sa2VA Model Zoo Collection Huggingace Model Zoo For Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos By Bytedance Seed CV Research • 12 items • Updated Nov 27, 2025 • 45