GenEvolve: Self-Evolving Image Generation Agents via Tool-Orchestrated Visual Experience Distillation Paper • 2605.21605 • Published 4 days ago • 10
Lance: Unified Multimodal Modeling by Multi-Task Synergy Paper • 2605.18678 • Published 6 days ago • 71
LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation Paper • 2605.18739 • Published 6 days ago • 108
NEWTON: Agentic Planning for Physically Grounded Video Generation Paper • 2605.18396 • Published 6 days ago • 22
FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization Paper • 2605.15824 • Published 9 days ago • 59
Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation Paper • 2605.15141 • Published 10 days ago • 91
SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 10 days ago • 80
CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives Paper • 2605.12496 • Published 12 days ago • 28
World Action Models: The Next Frontier in Embodied AI Paper • 2605.12090 • Published 12 days ago • 64
SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture Paper • 2605.12500 • Published 12 days ago • 185
WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors Paper • 2605.10434 • Published 13 days ago • 29
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation Paper • 2605.03849 • Published 19 days ago • 124
Lightning Unified Video Editing via In-Context Sparse Attention Paper • 2605.04569 • Published 18 days ago • 18
D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models Paper • 2605.05204 • Published 18 days ago • 27
ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models Paper • 2405.13729 • Published 25 days ago • 13
Meta-CoT: Enhancing Granularity and Generalization in Image Editing Paper • 2604.24625 • Published 27 days ago • 26
World-R1: Reinforcing 3D Constraints for Text-to-Video Generation Paper • 2604.24764 • Published 27 days ago • 118