A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.
Ray Yang
rayruiyang
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds
updated
a model
12 days ago
rayruiyang/VST-3B-SFT
updated
a model
12 days ago
rayruiyang/VST-3B-RL
Organizations
None yet