SeqTex: Generate Mesh Textures in Video Sequence Paper β’ 2507.04285 β’ Published 23 days ago β’ 8
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Paper β’ 2506.04225 β’ Published Jun 4 β’ 25
Vid2World: Crafting Video Diffusion Models to Interactive World Models Paper β’ 2505.14357 β’ Published May 20 β’ 27
Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models Paper β’ 2503.01774 β’ Published Mar 3 β’ 45
ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features Paper β’ 2502.04320 β’ Published Feb 6 β’ 38
VideoRoPE: What Makes for Good Video Rotary Position Embedding? Paper β’ 2502.05173 β’ Published Feb 7 β’ 65
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Paper β’ 2412.09596 β’ Published Dec 12, 2024 β’ 99
LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences Paper β’ 2412.01292 β’ Published Dec 2, 2024 β’ 13
NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images Paper β’ 2412.03517 β’ Published Dec 4, 2024 β’ 19
4Real-Video: Learning Generalizable Photo-Realistic 4D Video Diffusion Paper β’ 2412.04462 β’ Published Dec 5, 2024 β’ 8