MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper • 2501.12380 • Published Jan 21 • 86
Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers Paper • 2305.14987 • Published May 24, 2023 • 1
Visual Question Decomposition on Multimodal Large Language Models Paper • 2409.19339 • Published Sep 28, 2024 • 9