RLVLA
Collection
3 items
•
Updated
•
1
This is the RL model, fine-tuned from the warm-upped OpenVLA model. The RL training takes about 1.5M environment steps. For more details, please refer to the codebase and the paper.