-
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
Paper • 2506.01943 • Published • 24 -
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks
Paper • 2506.00411 • Published • 30 -
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics
Paper • 2506.01844 • Published • 121
Ron Zhu
RzZ
AI & ML interests
None yet
Recent Activity
updated
a collection
18 days ago
VLM
updated
a collection
about 1 month ago
VLM
liked
a model
about 1 month ago
Menlo/Jan-nano
Organizations
None yet