Xingxuan Li
commited on
Commit
·
72bf3da
1
Parent(s):
420f843
update readme
Browse files- .gitattributes +1 -0
- README.md +64 -1
- assets/7b_performance_training.png +3 -0
- assets/MiromindAI_H.svg +5 -0
.gitattributes
CHANGED
@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
|
|
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
37 |
+
assets/7b_performance_training.png filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -4,4 +4,67 @@ language:
|
|
4 |
- en
|
5 |
base_model:
|
6 |
- deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
|
7 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- en
|
5 |
base_model:
|
6 |
- deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
|
7 |
+
---
|
8 |
+
|
9 |
+
<!-- markdownlint-disable first-line-h1 -->
|
10 |
+
<!-- markdownlint-disable html -->
|
11 |
+
<!-- markdownlint-disable no-duplicate-header -->
|
12 |
+
|
13 |
+
<div align="center">
|
14 |
+
<img src="assets/MiromindAI_H.svg" width="50%" alt="MiroMindM1" />
|
15 |
+
</div>
|
16 |
+
<!-- <hr> -->
|
17 |
+
<div align="center">
|
18 |
+
|
19 |
+
[](https://huggingface.co/collections/Skywork/skywork-or1-67fa1bcb41b436ef2def76b9)
|
20 |
+
[](https://huggingface.co/datasets/Skywork/Skywork-OR1-RL-Data)
|
21 |
+
[](https://github.com/XYaoooo/MiroMind-M1)
|
22 |
+
[](https://github.com/XYaoooo/MiroMind-M1)
|
23 |
+
[](https://miromind.ai/)
|
24 |
+
|
25 |
+
</div>
|
26 |
+
|
27 |
+
|
28 |
+
|
29 |
+
# MiroMind-M1
|
30 |
+
|
31 |
+
|
32 |
+
## 🧾 Overview
|
33 |
+
<div align="center">
|
34 |
+
<img src="assets/7b_performance_training.png" width="80%" alt="7B Model Training Performance" />
|
35 |
+
<p><i>Training performance of MiroMind-M1-RL-7B on AIME24 and AIME25.</i></p>
|
36 |
+
</div>
|
37 |
+
|
38 |
+
[One paragraph to introduce both MiroMind-M1 model series.]
|
39 |
+
|
40 |
+
|
41 |
+
## 📊 Evaluation
|
42 |
+
|
43 |
+
### MiroMind-M1-SFT
|
44 |
+
| Model | Initial Checkpoint | AIME24 (avg@64) | AIME25 (avg@64) | MATH500 (avg@5) |
|
45 |
+
|------------------|----------------------------|--------|--------|---------|
|
46 |
+
| DeepSeek-R1-Distill | Qwen2.5-Math-7B | 55.5 | 40.4† | 92.8 |
|
47 |
+
| OpenThoughts | Qwen2.5-7-Instruct | 31.3 | 23.3 | 83.2 |
|
48 |
+
| Open-R1 | Qwen2.5-Math-7B-Instruct | 36.7 | 40.0 | 90.6 |
|
49 |
+
| Synthetic-1 | Qwen2.5-7B-Instruct | 30.0 | 26.6 | 85.6 |
|
50 |
+
| **MiroMind-SFT-7B** | Qwen2.5-Math-7B | 60.4 | 45.0 | 94.6 |
|
51 |
+
|
52 |
+
*† means that the score of DeepSeek-R1 on AIME25 is from our evaluation.*
|
53 |
+
|
54 |
+
### MiroMind-M1-RL
|
55 |
+
| Model | AIME24 (avg@64) | AIME25 (avg@64) | MATH500 (avg@5) |
|
56 |
+
|----------------------------------|--------|--------|---------|
|
57 |
+
| DeepSeek-R1 | 79.8 | 70.0 | – |
|
58 |
+
| DeepSeek-R1-0528 | 91.4 | 87.5 | – |
|
59 |
+
| Qwen3-8B | 76.0 | 67.3 | – |
|
60 |
+
| DeepSeek-R1-0528-Qwen3-8B | 86.0 | 76.3 | – |
|
61 |
+
| <tr><td colspan="4" align="center"><em>**32B Models trained from Qwen2.5 series**</em></td></tr> |
|
62 |
+
| DeepSeek-R1-Distill-Qwen-32B | 70.8 | 52.1 | 95.8 |
|
63 |
+
| Skywork-OR1-32B-Preview | 77.1 | 68.2 | 97.5 |
|
64 |
+
| **MiroMind-M1-RL-32B** | 77.5 | 65.6 | 96.4 |
|
65 |
+
| <tr><td colspan="4" align="center"><em>**7B Models trained from Qwen2.5 series**</em></td></tr> |
|
66 |
+
| DeepSeek-R1-Distill-Qwen-7B | 55.5 | 39.2 | – |
|
67 |
+
| **MiroMind-M1-SFT-7B** | 60.4 | 45.0 | 94.6 |
|
68 |
+
| Light-R1-7B-DS | 59.1 | 44.3 | – |
|
69 |
+
| Skywork-OR1-7B | 72.2 | 54.6 | – |
|
70 |
+
| **MiroMind-M1-RL-7B** | 73.4 | 57.8 | 96.7 |
|
assets/7b_performance_training.png
ADDED
![]() |
Git LFS Details
|
assets/MiromindAI_H.svg
ADDED
|