LongVie 2: Multimodal Controllable Ultra-Long Video World Model Paper β’ 2512.13604 β’ Published 27 days ago β’ 73
EgoX: Egocentric Video Generation from a Single Exocentric Video Paper β’ 2512.08269 β’ Published Dec 9, 2025 β’ 117
What about gravity in video generation? Post-Training Newton's Laws with Verifiable Rewards Paper β’ 2512.00425 β’ Published Nov 29, 2025 β’ 50
Guided Self-Evolving LLMs with Minimal Human Supervision Paper β’ 2512.02472 β’ Published Dec 2, 2025 β’ 51
How Far Are We from Genuinely Useful Deep Research Agents? Paper β’ 2512.01948 β’ Published Dec 1, 2025 β’ 54
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published Dec 1, 2025 β’ 72
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper β’ 2511.22570 β’ Published Nov 27, 2025 β’ 87
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration Paper β’ 2511.21689 β’ Published Nov 26, 2025 β’ 114
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle Paper β’ 2512.04324 β’ Published Dec 3, 2025 β’ 150
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling Paper β’ 2511.20785 β’ Published Nov 25, 2025 β’ 182
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper β’ 2512.04677 β’ Published Dec 4, 2025 β’ 167
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper β’ 2511.22699 β’ Published Nov 27, 2025 β’ 226
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper β’ 2512.02556 β’ Published Dec 2, 2025 β’ 249
EditThinker: Unlocking Iterative Reasoning for Any Image Editor Paper β’ 2512.05965 β’ Published Dec 5, 2025 β’ 38
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation Paper β’ 2512.07831 β’ Published Dec 8, 2025 β’ 16