-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper • 2411.00225 • Published • 11 -
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Paper • 2410.22901 • Published • 8 -
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Paper • 2506.18898 • Published • 33
Zhongwei Zhang
zzwustc
AI & ML interests
AIGC
Recent Activity
upvoted
a
paper
17 days ago
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation
liked
a model
17 days ago
chetwinlow1/Ovi
liked
a Space
about 1 month ago
finegrain/finegrain-object-eraser