formatec
/

my_awesome_eli5_clm-model

Text Generation

generated_from_keras_callback

Model card Files Files and versions

formatec commited on Jan 1, 2024

Commit

1627106

·

1 Parent(s): 882ed99

Training in progress epoch 2

Files changed (2) hide show

README.md +4 -3
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -15,9 +15,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 3.7906
-- Validation Loss: 3.7595
-- Epoch: 1
 ## Model description
@@ -45,6 +45,7 @@ The following hyperparameters were used during training:
 |:----------:|:---------------:|:-----:|
 | 3.9064     | 3.7834          | 0     |
 | 3.7906     | 3.7595          | 1     |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 3.7298
+- Validation Loss: 3.7510
+- Epoch: 2
 ## Model description
 |:----------:|:---------------:|:-----:|
 | 3.9064     | 3.7834          | 0     |
 | 3.7906     | 3.7595          | 1     |
+| 3.7298     | 3.7510          | 2     |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f1827b27899c68fdf4040b4fced1899f24b4ea1bc8040eabb4da7fe8512f1281
 size 327745472

 version https://git-lfs.github.com/spec/v1
+oid sha256:340429a764e86145cbf715a5d51ff2c76abc8a4cd9eda86c96de5368e8a65496
 size 327745472