arXiv:2511.01678
Rui Zhao
ruizhaocv
AI & ML interests
Multimodal and GenAI
Recent Activity
upvoted
a
paper
about 17 hours ago
Grounding Computer Use Agents on Human Demonstrations
upvoted
a
paper
8 days ago
VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual
Representation
authored
a paper
9 days ago
UniLumos: Fast and Unified Image and Video Relighting with
Physics-Plausible Feedback