Manipulation
Collection
Manipulation-related datasets and models
•
15 items
•
Updated
•
6
InternVLA-M1 is an open-source, end-to-end vision–language–action (VLA) framework for building and researching generalist robot policies.
action_chunk: 8
batch_size: 128
training_steps: 30k
@misc{internvla2024,
title = {InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy},
author = {InternVLA-M1 Contributors},
year = {2025},
booktitle={arXiv},
}