MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset Paper • 2406.14272 • Published Jun 20, 2024 • 2
AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models Paper • 2410.18325 • Published Oct 23, 2024 • 1
Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert Paper • 2407.01034 • Published Jul 1, 2024 • 1
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics Paper • 2503.20308 • Published Mar 26 • 23