VLA-RL-Study: What Can RL Bring to VLA Generalization? An Empirical Study

arXiv Website

This is the RL model, fine-tuned from the warm-upped OpenVLA model. The RL training takes about 1.5M environment steps. For more details, please refer to the codebase and the paper.

Downloads last month
47
Safetensors
Model size
7.54B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for gen-robot/openvla-7b-rlvla-rl

Base model

openvla/openvla-7b
Finetuned
(2)
this model

Collection including gen-robot/openvla-7b-rlvla-rl