Xingxuan Li commited on
Commit
72bf3da
·
1 Parent(s): 420f843

update readme

Browse files
.gitattributes CHANGED
@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
 
 
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
  tokenizer.json filter=lfs diff=lfs merge=lfs -text
37
+ assets/7b_performance_training.png filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -4,4 +4,67 @@ language:
4
  - en
5
  base_model:
6
  - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
7
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  - en
5
  base_model:
6
  - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
7
+ ---
8
+
9
+ <!-- markdownlint-disable first-line-h1 -->
10
+ <!-- markdownlint-disable html -->
11
+ <!-- markdownlint-disable no-duplicate-header -->
12
+
13
+ <div align="center">
14
+ <img src="assets/MiromindAI_H.svg" width="50%" alt="MiroMindM1" />
15
+ </div>
16
+ <!-- <hr> -->
17
+ <div align="center">
18
+
19
+ [![Models](https://img.shields.io/badge/Models-5EDDD2?style=for-the-badge&logo=huggingface&logoColor=ffffff&labelColor)](https://huggingface.co/collections/Skywork/skywork-or1-67fa1bcb41b436ef2def76b9)
20
+ [![Data](https://img.shields.io/badge/Data-0040A1?style=for-the-badge&logo=huggingface&logoColor=ffffff&labelColor)](https://huggingface.co/datasets/Skywork/Skywork-OR1-RL-Data)
21
+ [![Paper](https://img.shields.io/badge/Code-000000?style=for-the-badge&logo=arxiv&logoColor=white)](https://github.com/XYaoooo/MiroMind-M1)
22
+ [![Github](https://img.shields.io/badge/Code-000000?style=for-the-badge&logo=github&logoColor=white)](https://github.com/XYaoooo/MiroMind-M1)
23
+ [![Website](https://img.shields.io/badge/Website-000000?style=for-the-badge&logo=google-chrome&logoColor=white)](https://miromind.ai/)
24
+
25
+ </div>
26
+
27
+
28
+
29
+ # MiroMind-M1
30
+
31
+
32
+ ## 🧾 Overview
33
+ <div align="center">
34
+ <img src="assets/7b_performance_training.png" width="80%" alt="7B Model Training Performance" />
35
+ <p><i>Training performance of MiroMind-M1-RL-7B on AIME24 and AIME25.</i></p>
36
+ </div>
37
+
38
+ [One paragraph to introduce both MiroMind-M1 model series.]
39
+
40
+
41
+ ## 📊 Evaluation
42
+
43
+ ### MiroMind-M1-SFT
44
+ | Model | Initial Checkpoint | AIME24 (avg@64) | AIME25 (avg@64) | MATH500 (avg@5) |
45
+ |------------------|----------------------------|--------|--------|---------|
46
+ | DeepSeek-R1-Distill | Qwen2.5-Math-7B | 55.5 | 40.4† | 92.8 |
47
+ | OpenThoughts | Qwen2.5-7-Instruct | 31.3 | 23.3 | 83.2 |
48
+ | Open-R1 | Qwen2.5-Math-7B-Instruct | 36.7 | 40.0 | 90.6 |
49
+ | Synthetic-1 | Qwen2.5-7B-Instruct | 30.0 | 26.6 | 85.6 |
50
+ | **MiroMind-SFT-7B** | Qwen2.5-Math-7B | 60.4 | 45.0 | 94.6 |
51
+
52
+ *† means that the score of DeepSeek-R1 on AIME25 is from our evaluation.*
53
+
54
+ ### MiroMind-M1-RL
55
+ | Model | AIME24 (avg@64) | AIME25 (avg@64) | MATH500 (avg@5) |
56
+ |----------------------------------|--------|--------|---------|
57
+ | DeepSeek-R1 | 79.8 | 70.0 | – |
58
+ | DeepSeek-R1-0528 | 91.4 | 87.5 | – |
59
+ | Qwen3-8B | 76.0 | 67.3 | – |
60
+ | DeepSeek-R1-0528-Qwen3-8B | 86.0 | 76.3 | – |
61
+ | <tr><td colspan="4" align="center"><em>**32B Models trained from Qwen2.5 series**</em></td></tr> |
62
+ | DeepSeek-R1-Distill-Qwen-32B | 70.8 | 52.1 | 95.8 |
63
+ | Skywork-OR1-32B-Preview | 77.1 | 68.2 | 97.5 |
64
+ | **MiroMind-M1-RL-32B** | 77.5 | 65.6 | 96.4 |
65
+ | <tr><td colspan="4" align="center"><em>**7B Models trained from Qwen2.5 series**</em></td></tr> |
66
+ | DeepSeek-R1-Distill-Qwen-7B | 55.5 | 39.2 | – |
67
+ | **MiroMind-M1-SFT-7B** | 60.4 | 45.0 | 94.6 |
68
+ | Light-R1-7B-DS | 59.1 | 44.3 | – |
69
+ | Skywork-OR1-7B | 72.2 | 54.6 | – |
70
+ | **MiroMind-M1-RL-7B** | 73.4 | 57.8 | 96.7 |
assets/7b_performance_training.png ADDED

Git LFS Details

  • SHA256: 9a6e5f0aca86594a04c592cd88f904869247fc3b48b97e6356089659c0aea537
  • Pointer size: 131 Bytes
  • Size of remote file: 195 kB
assets/MiromindAI_H.svg ADDED