update readme

Files changed (4) hide show

.gitattributes +1 -0
README.md +64 -1
assets/7b_performance_training.png +3 -0
assets/MiromindAI_H.svg +5 -0

.gitattributes CHANGED Viewed

@@ -34,3 +34,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 tokenizer.json filter=lfs diff=lfs merge=lfs -text

 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
 tokenizer.json filter=lfs diff=lfs merge=lfs -text
+assets/7b_performance_training.png filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -4,4 +4,67 @@ language:
 - en
 base_model:
 - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
----

 - en
 base_model:
 - deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
+---
+<!-- markdownlint-disable first-line-h1 -->
+<!-- markdownlint-disable html -->
+<!-- markdownlint-disable no-duplicate-header -->
+<div align="center">
+  <img src="assets/MiromindAI_H.svg" width="50%" alt="MiroMindM1" />
+</div>
+<!-- <hr> -->
+<div align="center">
+[![Models](https://img.shields.io/badge/Models-5EDDD2?style=for-the-badge&logo=huggingface&logoColor=ffffff&labelColor)](https://huggingface.co/collections/Skywork/skywork-or1-67fa1bcb41b436ef2def76b9)
+[![Data](https://img.shields.io/badge/Data-0040A1?style=for-the-badge&logo=huggingface&logoColor=ffffff&labelColor)](https://huggingface.co/datasets/Skywork/Skywork-OR1-RL-Data)
+[![Paper](https://img.shields.io/badge/Code-000000?style=for-the-badge&logo=arxiv&logoColor=white)](https://github.com/XYaoooo/MiroMind-M1)
+[![Github](https://img.shields.io/badge/Code-000000?style=for-the-badge&logo=github&logoColor=white)](https://github.com/XYaoooo/MiroMind-M1)
+[![Website](https://img.shields.io/badge/Website-000000?style=for-the-badge&logo=google-chrome&logoColor=white)](https://miromind.ai/)
+</div>
+# MiroMind-M1
+## 🧾 Overview
+<div align="center">
+  <img src="assets/7b_performance_training.png" width="80%" alt="7B Model Training Performance" />
+  <p><i>Training performance of MiroMind-M1-RL-7B on AIME24 and AIME25.</i></p>
+</div>
+[One paragraph to introduce both MiroMind-M1 model series.]
+## 📊 Evaluation
+### MiroMind-M1-SFT
+| Model           | Initial Checkpoint         | AIME24 (avg@64) | AIME25 (avg@64) | MATH500 (avg@5) |
+|------------------|----------------------------|--------|--------|---------|
+| DeepSeek-R1-Distill                  | Qwen2.5-Math-7B             | 55.5   | 40.4†  | 92.8    |
+| OpenThoughts                         | Qwen2.5-7-Instruct           | 31.3   | 23.3   | 83.2    |
+| Open-R1                              | Qwen2.5-Math-7B-Instruct     | 36.7   | 40.0   | 90.6    |
+| Synthetic-1                          | Qwen2.5-7B-Instruct          | 30.0   | 26.6   | 85.6    |
+| **MiroMind-SFT-7B**                  | Qwen2.5-Math-7B             | 60.4   | 45.0   | 94.6    |
+*† means that the score of DeepSeek-R1 on AIME25 is from our evaluation.*
+### MiroMind-M1-RL
+| Model                            | AIME24 (avg@64) | AIME25 (avg@64) | MATH500 (avg@5) |
+|----------------------------------|--------|--------|---------|
+| DeepSeek-R1                      | 79.8   | 70.0   | –       |
+| DeepSeek-R1-0528                 | 91.4   | 87.5   | –       |
+| Qwen3-8B                         | 76.0   | 67.3   | –       |
+| DeepSeek-R1-0528-Qwen3-8B        | 86.0   | 76.3   | –       |
+| <tr><td colspan="4" align="center"><em>**32B Models trained from Qwen2.5 series**</em></td></tr> |
+| DeepSeek-R1-Distill-Qwen-32B     | 70.8   | 52.1   | 95.8    |
+| Skywork-OR1-32B-Preview          | 77.1   | 68.2   | 97.5    |
+| **MiroMind-M1-RL-32B**           | 77.5   | 65.6   | 96.4    |
+| <tr><td colspan="4" align="center"><em>**7B Models trained from Qwen2.5 series**</em></td></tr> |
+| DeepSeek-R1-Distill-Qwen-7B      | 55.5   | 39.2   | –       |
+| **MiroMind-M1-SFT-7B**           | 60.4   | 45.0   | 94.6    |
+| Light-R1-7B-DS                   | 59.1   | 44.3   | –       |
+| Skywork-OR1-7B                   | 72.2   | 54.6   | –       |
+| **MiroMind-M1-RL-7B**            | 73.4   | 57.8   | 96.7    |

assets/7b_performance_training.png ADDED Viewed

Git LFS Details

SHA256: 9a6e5f0aca86594a04c592cd88f904869247fc3b48b97e6356089659c0aea537
Pointer size: 131 Bytes
Size of remote file: 195 kB

assets/MiromindAI_H.svg ADDED Viewed