sarraj19 commited on
Commit
209ebc8
·
verified ·
1 Parent(s): b841a9e

End of training

Browse files
README.md CHANGED
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.8520
20
  - Rouge1: 0.0
21
  - Rouge2: 0.0
22
  - Rougel: 0.0
23
  - Rougelsum: 0.0
24
- - Gen Len: 6.2727
25
 
26
  ## Model description
27
 
@@ -52,15 +52,15 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
55
- | No log | 1.0 | 39 | 0.8280 | 0.0 | 0.0 | 0.0 | 0.0 | 6.5714 |
56
- | No log | 2.0 | 78 | 0.7957 | 0.0 | 0.0 | 0.0 | 0.0 | 6.5844 |
57
- | No log | 3.0 | 117 | 0.8424 | 0.0 | 0.0 | 0.0 | 0.0 | 6.3896 |
58
- | No log | 4.0 | 156 | 0.8520 | 0.0 | 0.0 | 0.0 | 0.0 | 6.2727 |
59
 
60
 
61
  ### Framework versions
62
 
63
- - Transformers 4.38.2
64
  - Pytorch 2.2.1+cu121
65
- - Datasets 2.18.0
66
- - Tokenizers 0.15.2
 
16
 
17
  This model is a fine-tuned version of [facebook/mbart-large-50-many-to-many-mmt](https://huggingface.co/facebook/mbart-large-50-many-to-many-mmt) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.4294
20
  - Rouge1: 0.0
21
  - Rouge2: 0.0
22
  - Rougel: 0.0
23
  - Rougelsum: 0.0
24
+ - Gen Len: 6.0519
25
 
26
  ## Model description
27
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
54
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
55
+ | No log | 1.0 | 39 | 0.4279 | 0.0 | 0.0 | 0.0 | 0.0 | 5.8442 |
56
+ | No log | 2.0 | 78 | 0.4427 | 0.0 | 0.0 | 0.0 | 0.0 | 5.7532 |
57
+ | No log | 3.0 | 117 | 0.4261 | 0.0 | 0.0 | 0.0 | 0.0 | 5.7273 |
58
+ | No log | 4.0 | 156 | 0.4294 | 0.0 | 0.0 | 0.0 | 0.0 | 6.0519 |
59
 
60
 
61
  ### Framework versions
62
 
63
+ - Transformers 4.40.1
64
  - Pytorch 2.2.1+cu121
65
+ - Datasets 2.19.0
66
+ - Tokenizers 0.19.1
config.json CHANGED
@@ -52,7 +52,7 @@
52
  "static_position_embeddings": false,
53
  "tokenizer_class": "MBart50Tokenizer",
54
  "torch_dtype": "float32",
55
- "transformers_version": "4.38.2",
56
  "use_cache": true,
57
  "vocab_size": 250054
58
  }
 
52
  "static_position_embeddings": false,
53
  "tokenizer_class": "MBart50Tokenizer",
54
  "torch_dtype": "float32",
55
+ "transformers_version": "4.40.1",
56
  "use_cache": true,
57
  "vocab_size": 250054
58
  }
generation_config.json CHANGED
@@ -7,5 +7,5 @@
7
  "max_length": 200,
8
  "num_beams": 5,
9
  "pad_token_id": 1,
10
- "transformers_version": "4.38.2"
11
  }
 
7
  "max_length": 200,
8
  "num_beams": 5,
9
  "pad_token_id": 1,
10
+ "transformers_version": "4.40.1"
11
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:811ec3b44a52ee22d67a875ad875864ca75657a37ac7b6b852dbd2282a7be17f
3
  size 2444578688
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:baac0c73774a0e0f220b1aea8f278d51440acc61e3139dcef4c2a8595ce3737f
3
  size 2444578688
runs/May06_02-26-41_930f38d6e5d3/events.out.tfevents.1714962404.930f38d6e5d3.161.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c20c924accc6cf2309728dd2494e26edfcea229fa2e7c404d6efe84ee176a5b0
3
+ size 8006
tokenizer.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:73397494b7dbed83d0bdc990eaf128cf2397ff728c768a1d4225695383f28b62
3
- size 17110040
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a76af549f03c34dfcfc9731ad073ab1e3acd0d2caa15c95b1192d67f33015c64
3
+ size 17110018
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:58f005f6bfeb098aa6411a0b802863db4ca71f884292451bbd0416f2ad507b0e
3
- size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b649eebc8b4e868edec53d5c4bba5ae78ff134991f0cd8ce904124cabb35dbb6
3
+ size 5112