Xizhou Zhu
Einsiedler
AI & ML interests
None yet
Recent Activity
authored
a paper
7 days ago
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal
Large Language Models
authored
a paper
3 months ago
VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal
Large Language Models
authored
a paper
4 months ago
Dita: Scaling Diffusion Transformer for Generalist
Vision-Language-Action Policy