InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems Paper • 2506.16381 • Published Jun 19 • 2
UniF^2ace: Fine-grained Face Understanding and Generation with Unified Multimodal Models Paper • 2503.08120 • Published Mar 11 • 32
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics Paper • 2501.04686 • Published Jan 8 • 53
Flames: Benchmarking Value Alignment of LLMs in Chinese Paper • 2311.06899 • Published Nov 12, 2023 • 2