Update README.md
Browse files
README.md
CHANGED
|
@@ -40,12 +40,13 @@ The models are evaluated using open-ended and multiple-choice math problems from
|
|
| 40 |
|---------------------------|---------------|-----------|-----------|-----------|
|
| 41 |
| MAmmoTH-7B | **Hybrid** | 53.6 | 31.5 | 44.5 |
|
| 42 |
| MAmmoTH-Coder-7B | **Hybrid** | 59.4 | 33.4 | 47.2 |
|
| 43 |
-
| MetaMath-7B-Mistral | **CoT** |
|
| 44 |
| OpenChat-3.5-7B | **CoT** | 77.3 | 28.6 | 49.6 |
|
|
|
|
| 45 |
| DeepSeek-Coder-34B | **PoT** | 58.2 | 35.3 | 46.5 |
|
| 46 |
| Grok-1 | **CoT** | 62.9 | 15.7 | - |
|
| 47 |
| QWen-72B | **CoT** | 78.9 | 35.2 | - |
|
| 48 |
-
|
|
| 49 |
| MAmmoTH-7B-Mistral | **Hybrid** | 75.0 | **40.0** | **52.5** |
|
| 50 |
|
| 51 |
## Usage
|
|
|
|
| 40 |
|---------------------------|---------------|-----------|-----------|-----------|
|
| 41 |
| MAmmoTH-7B | **Hybrid** | 53.6 | 31.5 | 44.5 |
|
| 42 |
| MAmmoTH-Coder-7B | **Hybrid** | 59.4 | 33.4 | 47.2 |
|
| 43 |
+
| MetaMath-7B-Mistral | **CoT** | 77.7 | 28.2 | 49.3 |
|
| 44 |
| OpenChat-3.5-7B | **CoT** | 77.3 | 28.6 | 49.6 |
|
| 45 |
+
| ChatGLM-3-6B | **CoT** | 72.3 | 25.7 | 45.6 |
|
| 46 |
| DeepSeek-Coder-34B | **PoT** | 58.2 | 35.3 | 46.5 |
|
| 47 |
| Grok-1 | **CoT** | 62.9 | 15.7 | - |
|
| 48 |
| QWen-72B | **CoT** | 78.9 | 35.2 | - |
|
| 49 |
+
| DeepSeek-67B-Chat | **CoT** | **84.1** | 32.6 | - |
|
| 50 |
| MAmmoTH-7B-Mistral | **Hybrid** | 75.0 | **40.0** | **52.5** |
|
| 51 |
|
| 52 |
## Usage
|