gotnogpu commited on
Commit
0d3bbaf
·
verified ·
1 Parent(s): aca75c9

End of training

Browse files
Files changed (1) hide show
  1. README.md +8 -6
README.md CHANGED
@@ -1,7 +1,8 @@
1
  ---
2
- library_name: transformers
3
  tags:
4
  - generated_from_trainer
 
5
  model-index:
6
  - name: distilgpt2
7
  results: []
@@ -14,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model was trained from scratch on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 3.3276
18
 
19
  ## Model description
20
 
@@ -45,14 +46,15 @@ The following hyperparameters were used during training:
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
- | No log | 1.0 | 476 | 3.3778 |
49
- | 3.5344 | 2.0 | 952 | 3.3372 |
50
- | 3.3754 | 3.0 | 1428 | 3.3276 |
51
 
52
 
53
  ### Framework versions
54
 
 
55
  - Transformers 4.51.3
56
  - Pytorch 2.6.0+cu124
57
  - Datasets 3.5.1
58
- - Tokenizers 0.21.1
 
1
  ---
2
+ library_name: peft
3
  tags:
4
  - generated_from_trainer
5
+ base_model: distilgpt2
6
  model-index:
7
  - name: distilgpt2
8
  results: []
 
15
 
16
  This model was trained from scratch on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 3.3275
19
 
20
  ## Model description
21
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 476 | 3.3277 |
50
+ | 3.3233 | 2.0 | 952 | 3.3276 |
51
+ | 3.3253 | 3.0 | 1428 | 3.3275 |
52
 
53
 
54
  ### Framework versions
55
 
56
+ - PEFT 0.15.2
57
  - Transformers 4.51.3
58
  - Pytorch 2.6.0+cu124
59
  - Datasets 3.5.1
60
+ - Tokenizers 0.21.1