ProactiveBench: A Comprehensive Benchmark Evaluating Proactive Interactions in Video Large Language Models Paper • 2507.09313 • Published 19 days ago • 1
HawkEye: Training Video-Text LLMs for Grounding Text in Videos Paper • 2403.10228 • Published Mar 15, 2024 • 1
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding Paper • 2412.17295 • Published Dec 23, 2024 • 9