DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning Paper • 2603.12257 • Published 16 days ago • 31
Accelerating Masked Image Generation by Learning Latent Controlled Dynamics Paper • 2602.23996 • Published 29 days ago • 8
WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation Paper • 2503.19065 • Published Mar 24, 2025 • 11
From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning Paper • 2504.16080 • Published Apr 22, 2025 • 15
From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors Paper • 2602.21778 • Published Feb 25 • 14
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture Paper • 2512.21675 • Published Dec 25, 2025 • 25
Omni-Weather: Unified Multimodal Foundation Model for Weather Generation and Understanding Paper • 2512.21643 • Published Dec 25, 2025 • 13
Omni-Weather: Unified Multimodal Foundation Model for Weather Generation and Understanding Paper • 2512.21643 • Published Dec 25, 2025 • 13
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture Paper • 2512.21675 • Published Dec 25, 2025 • 25
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision Paper • 2504.04903 • Published Apr 7, 2025
Factuality Matters: When Image Generation and Editing Meet Structured Visuals Paper • 2510.05091 • Published Oct 6, 2025 • 20
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding Paper • 2510.06308 • Published Oct 7, 2025 • 55
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published Oct 20, 2025 • 65
PICABench: How Far Are We from Physically Realistic Image Editing? Paper • 2510.17681 • Published Oct 20, 2025 • 65
TGDPO: Harnessing Token-Level Reward Guidance for Enhancing Direct Preference Optimization Paper • 2506.14574 • Published Jun 17, 2025 • 1
Animate-X++: Universal Character Image Animation with Dynamic Backgrounds Paper • 2508.09454 • Published Aug 13, 2025
Factuality Matters: When Image Generation and Editing Meet Structured Visuals Paper • 2510.05091 • Published Oct 6, 2025 • 20
From Noisy Traces to Stable Gradients: Bias-Variance Optimized Preference Optimization for Aligning Large Reasoning Models Paper • 2510.05095 • Published Oct 6, 2025 • 1
Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents Paper • 2510.06214 • Published Oct 7, 2025 • 1