AV-Deepfake1M++: A Large-Scale Audio-Visual Deepfake Benchmark with Real-World Perturbations Paper • 2507.20579 • Published Jul 28
GeoChat: Grounded Large Vision-Language Model for Remote Sensing Paper • 2311.15826 • Published Nov 24, 2023
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages Paper • 2411.16508 • Published Nov 25, 2024 • 12
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks Paper • 2411.19325 • Published Nov 28, 2024