Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training Paper • 2509.26625 • Published Sep 30, 2025 • 43
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Paper • 2506.18898 • Published Jun 23, 2025 • 33
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper • 2506.09113 • Published Jun 10, 2025 • 105
Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation Paper • 2506.09350 • Published Jun 11, 2025 • 48
ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions Paper • 2506.03107 • Published Jun 3, 2025 • 2 • 2
ByteMorph Collection Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions • 6 items • Updated Jun 3, 2025 • 1