gotnogpu
/

distilgpt2

Generated from Trainer

Model card Files Files and versions

gotnogpu commited on May 5

Commit

0d3bbaf

·

verified ·

1 Parent(s): aca75c9

End of training

Files changed (1) hide show

README.md +8 -6

README.md CHANGED Viewed

@@ -1,7 +1,8 @@
 ---
-library_name: transformers
 tags:
 - generated_from_trainer
 model-index:
 - name: distilgpt2
   results: []
@@ -14,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.3276
 ## Model description
@@ -45,14 +46,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 476  | 3.3778          |
-| 3.5344        | 2.0   | 952  | 3.3372          |
-| 3.3754        | 3.0   | 1428 | 3.3276          |
 ### Framework versions
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
 - Datasets 3.5.1
-- Tokenizers 0.21.1

 ---
+library_name: peft
 tags:
 - generated_from_trainer
+base_model: distilgpt2
 model-index:
 - name: distilgpt2
   results: []
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.3275
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 476  | 3.3277          |
+| 3.3233        | 2.0   | 952  | 3.3276          |
+| 3.3253        | 3.0   | 1428 | 3.3275          |
 ### Framework versions
+- PEFT 0.15.2
 - Transformers 4.51.3
 - Pytorch 2.6.0+cu124
 - Datasets 3.5.1
+- Tokenizers 0.21.1