Text Generation
Transformers
Safetensors
PyTorch
English
llama
nvidia
llama-3
conversational
text-generation-inference
suhara commited on
Commit
a3ff62e
·
verified ·
1 Parent(s): ed22ac2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -16
README.md CHANGED
@@ -38,7 +38,7 @@ GOVERNING TERMS: Your use of this model is governed by the [NVIDIA Open Model Li
38
 
39
  **Model Developer:** NVIDIA
40
 
41
- **Model Dates:** Trained between August 2024 and April 2025
42
 
43
  **Data Freshness:** The pretraining data has a cutoff of 2023 per Meta Llama 3.1 8B
44
 
@@ -48,11 +48,11 @@ GOVERNING TERMS: Your use of this model is governed by the [NVIDIA Open Model Li
48
  Developers designing AI Agent systems, chatbots, RAG systems, and other AI-powered applications. Also suitable for typical instruction-following tasks. Balance of model accuracy and compute efficiency (the model fits on a single RTX GPU and can be used locally).
49
 
50
  ## Release Date: <br>
51
- x/xx/2025 <br>
52
 
53
  ## References
54
 
55
- - [\[2502.00203\] Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment](https://arxiv.org/abs/2502.00203)
56
 
57
 
58
  ## Model Architecture
@@ -223,16 +223,16 @@ These results contain both “Reasoning On”, and “Reasoning Off”. We recom
223
 
224
  | Reasoning Mode | Score |
225
  |--------------|------------|
226
- | Reasoning Off | 7.6 |
227
- | Reasoning On | 8.1 |
228
 
229
 
230
  ### MATH500
231
 
232
  | Reasoning Mode | pass@1 |
233
  |--------------|------------|
234
- | Reasoning Off | 72.0% |
235
- | Reasoning On | 95.1% |
236
 
237
  User Prompt Template:
238
 
@@ -246,7 +246,7 @@ User Prompt Template:
246
  | Reasoning Mode | pass@1 |
247
  |--------------|------------|
248
  | Reasoning Off | 13.3% |
249
- | Reasoning On | 46.7% |
250
 
251
  User Prompt Template:
252
 
@@ -259,8 +259,8 @@ User Prompt Template:
259
 
260
  | Reasoning Mode | pass@1 |
261
  |--------------|------------|
262
- | Reasoning Off | 31.8% |
263
- | Reasoning On | 55.8% |
264
 
265
  User Prompt Template:
266
 
@@ -274,15 +274,15 @@ User Prompt Template:
274
 
275
  | Reasoning Mode | Strict:Prompt | Strict:Instruction |
276
  |--------------|------------|------------|
277
- | Reasoning Off | 73.6% | 80.8% |
278
- | Reasoning On | 75.4% | 82.6% |
279
 
280
  ### BFCL v2 Live
281
 
282
  | Reasoning Mode | Score |
283
  |--------------|------------|
284
- | Reasoning Off | 57.1% |
285
- | Reasoning On | 64.2% |
286
 
287
  User Prompt Template:
288
 
@@ -298,8 +298,8 @@ User Prompt Template:
298
 
299
  | Reasoning Mode | pass@1 |
300
  |--------------|------------|
301
- | Reasoning Off | 66.4% |
302
- | Reasoning On | 86.0% |
303
 
304
  User Prompt Template:
305
 
 
38
 
39
  **Model Developer:** NVIDIA
40
 
41
+ **Model Dates:** Trained between August 2024 and May 2025
42
 
43
  **Data Freshness:** The pretraining data has a cutoff of 2023 per Meta Llama 3.1 8B
44
 
 
48
  Developers designing AI Agent systems, chatbots, RAG systems, and other AI-powered applications. Also suitable for typical instruction-following tasks. Balance of model accuracy and compute efficiency (the model fits on a single RTX GPU and can be used locally).
49
 
50
  ## Release Date: <br>
51
+ 5/19/2025 <br>
52
 
53
  ## References
54
 
55
+ - [\[2505.00949\] Llama-Nemotron: Efficient Reasoning Models](https://arxiv.org/abs/2505.00949)
56
 
57
 
58
  ## Model Architecture
 
223
 
224
  | Reasoning Mode | Score |
225
  |--------------|------------|
226
+ | Reasoning Off | 7.4 |
227
+ | Reasoning On | 8.0 |
228
 
229
 
230
  ### MATH500
231
 
232
  | Reasoning Mode | pass@1 |
233
  |--------------|------------|
234
+ | Reasoning Off | 71.8% |
235
+ | Reasoning On | 96.2% |
236
 
237
  User Prompt Template:
238
 
 
246
  | Reasoning Mode | pass@1 |
247
  |--------------|------------|
248
  | Reasoning Off | 13.3% |
249
+ | Reasoning On | 46.3% |
250
 
251
  User Prompt Template:
252
 
 
259
 
260
  | Reasoning Mode | pass@1 |
261
  |--------------|------------|
262
+ | Reasoning Off | 33.8% |
263
+ | Reasoning On | 55.1% |
264
 
265
  User Prompt Template:
266
 
 
274
 
275
  | Reasoning Mode | Strict:Prompt | Strict:Instruction |
276
  |--------------|------------|------------|
277
+ | Reasoning Off | 70.1% | 78.5% |
278
+ | Reasoning On | 75.5% | 82.6% |
279
 
280
  ### BFCL v2 Live
281
 
282
  | Reasoning Mode | Score |
283
  |--------------|------------|
284
+ | Reasoning Off | 63.6% |
285
+ | Reasoning On | 67.9% |
286
 
287
  User Prompt Template:
288
 
 
298
 
299
  | Reasoning Mode | pass@1 |
300
  |--------------|------------|
301
+ | Reasoning Off | 61.9% |
302
+ | Reasoning On | 85.8% |
303
 
304
  User Prompt Template:
305