arxiv:2508.19205
zhiliang
zzliang
AI & ML interests
multimodal
Recent Activity
authored
a paper
about 2 months ago
Generic-to-Specific Distillation of Masked Autoencoders
authored
a paper
about 2 months ago
Kosmos-G: Generating Images in Context with Multimodal Large Language
Models
authored
a paper
about 2 months ago
Integrally Migrating Pre-trained Transformer Encoder-decoders for Visual
Object Detection