Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
AI & ML interests
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
Recent Activity
View all activity
Organization Card
š„š„ Introducing MMR1 ā a Multimodal Reasoning Model trained with Variance-Aware Sampling (VAS)
š” Highlights
- Variance-Aware Sampling (VAS) for multimodal RL training:
- Establishes a theoretical link between reward variance and gradient signal strength;
- Proposes the Variance Promotion Score (VPS) integrating Outcome Variance and Trajectory Diversity;
- Enables more efficient and stable optimization under limited data conditions.
- Open-sources ~1.6M Long-CoT cold-start samples, annotated by Gemini 2.5 Pro/Flash and verified with GPT-4o.
- Releases a suite of SFT and RL checkpoints at multiple scales: 3B, 7B, and 32B variants.
š¦ Resources
- š Paper: MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
- š Model Checkpoints (SFT & RL):
- MMR1-3B-SFT | MMR1-3B-RL
- MMR1-7B-SFT | MMR1-7B-RL
- MMR1-32B-SFT | MMR1-32B-RL coming soon!
- š Datasets: MMR1-SFT, MMR1-RL
- š» Code: GitHub - MMR1
š Citation
If you find MMR1 useful for your research and applications, please cite using this BibTeX:
@misc{leng2025mmr1,
title={MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources},
author={Sicong Leng and Jing Wang and Jiaxi Li and Hao Zhang and Zhiqiang Hu and Boqiang Zhang and Yuming Jiang and Hang Zhang and Xin Li and Lidong Bing and Deli Zhao and Wei Lu and Yu Rong and Aixin Sun and Shijian Lu},
year={2025},
eprint={2509.21268},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2509.21268},
}
models
6

MMR1/MMR1-32B-SFT
Image-Text-to-Text
ā¢
0.0B
ā¢
Updated
ā¢
9

MMR1/MMR1-7B-SFT
Image-Text-to-Text
ā¢
8B
ā¢
Updated
ā¢
14

MMR1/MMR1-3B-SFT
Image-Text-to-Text
ā¢
4B
ā¢
Updated
ā¢
16

MMR1/MMR1-3B-RL
Image-Text-to-Text
ā¢
4B
ā¢
Updated
ā¢
14

MMR1/MMR1-7B-RL
Image-Text-to-Text
ā¢
8B
ā¢
Updated
ā¢
20

MMR1/MMR1-Math-v0-7B
Image-Text-to-Text
ā¢
8B
ā¢
Updated
ā¢
102
ā¢
6