1 28 218

Xi Yang

ianyeung

IanYeung

AI & ML interests

None yet

Recent Activity

liked a model about 4 hours ago

ByteDance/InfiniteYou

liked a model about 5 hours ago

internlm/Intern-S1-FP8

liked a model 3 days ago

zai-org/GLM-4.5

View all activity

Organizations

None yet

upvoted a paper 23 days ago

StreamDiT: Real-Time Streaming Text-to-Video Generation

Paper • 2507.03745 • Published 27 days ago • 28

upvoted a collection about 1 month ago

Kimi-VL-A3B

Collection

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated about 1 month ago • 71

upvoted an article about 1 month ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

•

Jan 30

• 105

upvoted a paper about 1 month ago

JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Paper • 2506.17612 • Published Jun 21 • 61

upvoted an article about 1 month ago

Article

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

and 4 others •

Jun 19

• 81

upvoted a paper about 1 month ago

Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18 • 65

upvoted 3 papers about 2 months ago

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation

Paper • 2506.09350 • Published Jun 11 • 48

Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion

Paper • 2506.08009 • Published Jun 9 • 26

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 98

upvoted 3 papers 3 months ago

PrimitiveAnything: Human-Crafted 3D Primitive Assembly Generation with Auto-Regressive Transformer

Paper • 2505.04622 • Published May 7 • 27

HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation

Paper • 2505.04512 • Published May 7 • 36

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24 • 93

upvoted a paper 4 months ago

Lumina-Image 2.0: A Unified and Efficient Image Generative Framework

Paper • 2503.21758 • Published Mar 27 • 22

upvoted 7 papers 5 months ago

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16 • 44

DiT-Air: Revisiting the Efficiency of Diffusion Model Architecture Design in Text to Image Generation

Paper • 2503.10618 • Published Mar 13 • 18

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published Mar 11 • 68

Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model

Paper • 2503.07703 • Published Mar 10 • 36

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Paper • 2503.07027 • Published Mar 10 • 29

VideoPainter: Any-length Video Inpainting and Editing with Plug-and-Play Context Control

Paper • 2503.05639 • Published Mar 7 • 24

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control

Paper • 2503.03751 • Published Mar 5 • 22