-
Fashion-VDM: Video Diffusion Model for Virtual Try-On
Paper • 2411.00225 • Published • 11 -
HelloMeme: Integrating Spatial Knitting Attentions to Embed High-Level and Fidelity-Rich Conditions in Diffusion Models
Paper • 2410.22901 • Published • 8 -
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations
Paper • 2506.18898 • Published • 32
Zhongwei Zhang
zzwustc
AI & ML interests
AIGC
Recent Activity
upvoted
an
article
about 4 hours ago
You could have designed state of the art positional encoding
liked
a model
18 days ago
ByteDance/Sa2VA-4B
liked
a Space
28 days ago
OmniGen2/OmniGen2