-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper • 2411.00225 • Published • 11 -
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Paper • 2410.22901 • Published • 8 -
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Paper • 2506.18898 • Published • 33
Zhongwei Zhang
zzwustc
AI & ML interests
AIGC
Recent Activity
upvoted
a
paper
5 days ago
FARMER: Flow AutoRegressive Transformer over Pixels
upvoted
a
paper
28 days ago
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
liked
a model
28 days ago
chetwinlow1/Ovi