Update README.md
Browse files
README.md
CHANGED
|
@@ -38,7 +38,7 @@ GOVERNING TERMS: Your use of this model is governed by the [NVIDIA Open Model Li
|
|
| 38 |
|
| 39 |
**Model Developer:** NVIDIA
|
| 40 |
|
| 41 |
-
**Model Dates:** Trained between August 2024 and
|
| 42 |
|
| 43 |
**Data Freshness:** The pretraining data has a cutoff of 2023 per Meta Llama 3.1 8B
|
| 44 |
|
|
@@ -48,11 +48,11 @@ GOVERNING TERMS: Your use of this model is governed by the [NVIDIA Open Model Li
|
|
| 48 |
Developers designing AI Agent systems, chatbots, RAG systems, and other AI-powered applications. Also suitable for typical instruction-following tasks. Balance of model accuracy and compute efficiency (the model fits on a single RTX GPU and can be used locally).
|
| 49 |
|
| 50 |
## Release Date: <br>
|
| 51 |
-
|
| 52 |
|
| 53 |
## References
|
| 54 |
|
| 55 |
-
- [\[
|
| 56 |
|
| 57 |
|
| 58 |
## Model Architecture
|
|
@@ -223,16 +223,16 @@ These results contain both “Reasoning On”, and “Reasoning Off”. We recom
|
|
| 223 |
|
| 224 |
| Reasoning Mode | Score |
|
| 225 |
|--------------|------------|
|
| 226 |
-
| Reasoning Off | 7.
|
| 227 |
-
| Reasoning On | 8.
|
| 228 |
|
| 229 |
|
| 230 |
### MATH500
|
| 231 |
|
| 232 |
| Reasoning Mode | pass@1 |
|
| 233 |
|--------------|------------|
|
| 234 |
-
| Reasoning Off |
|
| 235 |
-
| Reasoning On |
|
| 236 |
|
| 237 |
User Prompt Template:
|
| 238 |
|
|
@@ -246,7 +246,7 @@ User Prompt Template:
|
|
| 246 |
| Reasoning Mode | pass@1 |
|
| 247 |
|--------------|------------|
|
| 248 |
| Reasoning Off | 13.3% |
|
| 249 |
-
| Reasoning On | 46.
|
| 250 |
|
| 251 |
User Prompt Template:
|
| 252 |
|
|
@@ -259,8 +259,8 @@ User Prompt Template:
|
|
| 259 |
|
| 260 |
| Reasoning Mode | pass@1 |
|
| 261 |
|--------------|------------|
|
| 262 |
-
| Reasoning Off |
|
| 263 |
-
| Reasoning On | 55.
|
| 264 |
|
| 265 |
User Prompt Template:
|
| 266 |
|
|
@@ -274,15 +274,15 @@ User Prompt Template:
|
|
| 274 |
|
| 275 |
| Reasoning Mode | Strict:Prompt | Strict:Instruction |
|
| 276 |
|--------------|------------|------------|
|
| 277 |
-
| Reasoning Off |
|
| 278 |
-
| Reasoning On | 75.
|
| 279 |
|
| 280 |
### BFCL v2 Live
|
| 281 |
|
| 282 |
| Reasoning Mode | Score |
|
| 283 |
|--------------|------------|
|
| 284 |
-
| Reasoning Off |
|
| 285 |
-
| Reasoning On |
|
| 286 |
|
| 287 |
User Prompt Template:
|
| 288 |
|
|
@@ -298,8 +298,8 @@ User Prompt Template:
|
|
| 298 |
|
| 299 |
| Reasoning Mode | pass@1 |
|
| 300 |
|--------------|------------|
|
| 301 |
-
| Reasoning Off |
|
| 302 |
-
| Reasoning On |
|
| 303 |
|
| 304 |
User Prompt Template:
|
| 305 |
|
|
|
|
| 38 |
|
| 39 |
**Model Developer:** NVIDIA
|
| 40 |
|
| 41 |
+
**Model Dates:** Trained between August 2024 and May 2025
|
| 42 |
|
| 43 |
**Data Freshness:** The pretraining data has a cutoff of 2023 per Meta Llama 3.1 8B
|
| 44 |
|
|
|
|
| 48 |
Developers designing AI Agent systems, chatbots, RAG systems, and other AI-powered applications. Also suitable for typical instruction-following tasks. Balance of model accuracy and compute efficiency (the model fits on a single RTX GPU and can be used locally).
|
| 49 |
|
| 50 |
## Release Date: <br>
|
| 51 |
+
5/19/2025 <br>
|
| 52 |
|
| 53 |
## References
|
| 54 |
|
| 55 |
+
- [\[2505.00949\] Llama-Nemotron: Efficient Reasoning Models](https://arxiv.org/abs/2505.00949)
|
| 56 |
|
| 57 |
|
| 58 |
## Model Architecture
|
|
|
|
| 223 |
|
| 224 |
| Reasoning Mode | Score |
|
| 225 |
|--------------|------------|
|
| 226 |
+
| Reasoning Off | 7.4 |
|
| 227 |
+
| Reasoning On | 8.0 |
|
| 228 |
|
| 229 |
|
| 230 |
### MATH500
|
| 231 |
|
| 232 |
| Reasoning Mode | pass@1 |
|
| 233 |
|--------------|------------|
|
| 234 |
+
| Reasoning Off | 71.8% |
|
| 235 |
+
| Reasoning On | 96.2% |
|
| 236 |
|
| 237 |
User Prompt Template:
|
| 238 |
|
|
|
|
| 246 |
| Reasoning Mode | pass@1 |
|
| 247 |
|--------------|------------|
|
| 248 |
| Reasoning Off | 13.3% |
|
| 249 |
+
| Reasoning On | 46.3% |
|
| 250 |
|
| 251 |
User Prompt Template:
|
| 252 |
|
|
|
|
| 259 |
|
| 260 |
| Reasoning Mode | pass@1 |
|
| 261 |
|--------------|------------|
|
| 262 |
+
| Reasoning Off | 33.8% |
|
| 263 |
+
| Reasoning On | 55.1% |
|
| 264 |
|
| 265 |
User Prompt Template:
|
| 266 |
|
|
|
|
| 274 |
|
| 275 |
| Reasoning Mode | Strict:Prompt | Strict:Instruction |
|
| 276 |
|--------------|------------|------------|
|
| 277 |
+
| Reasoning Off | 70.1% | 78.5% |
|
| 278 |
+
| Reasoning On | 75.5% | 82.6% |
|
| 279 |
|
| 280 |
### BFCL v2 Live
|
| 281 |
|
| 282 |
| Reasoning Mode | Score |
|
| 283 |
|--------------|------------|
|
| 284 |
+
| Reasoning Off | 63.6% |
|
| 285 |
+
| Reasoning On | 67.9% |
|
| 286 |
|
| 287 |
User Prompt Template:
|
| 288 |
|
|
|
|
| 298 |
|
| 299 |
| Reasoning Mode | pass@1 |
|
| 300 |
|--------------|------------|
|
| 301 |
+
| Reasoning Off | 61.9% |
|
| 302 |
+
| Reasoning On | 85.8% |
|
| 303 |
|
| 304 |
User Prompt Template:
|
| 305 |
|