Model save

Browse files

Files changed (5) hide show

README.md +85 -0
generation_config.json +16 -0
runs/Aug17_02-56-40_e2e-68-229/events.out.tfevents.1723877827.e2e-68-229 +2 -2
runs/Aug17_09-33-09_e2e-68-229/events.out.tfevents.1723901621.e2e-68-229 +3 -0
training_args.bin +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,85 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: Helsinki-NLP/opus-mt-mul-en
+tags:
+- generated_from_trainer
+metrics:
+- bleu
+model-index:
+- name: marianMT_hin_eng_cs
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# marianMT_hin_eng_cs
+This model is a fine-tuned version of [Helsinki-NLP/opus-mt-mul-en](https://huggingface.co/Helsinki-NLP/opus-mt-mul-en) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Bleu: 78.0784
+- Gen Len: 74.6804
+- Loss: 0.1472
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 50
+- eval_batch_size: 50
+- seed: 42
+- distributed_type: multi-GPU
+- num_devices: 2
+- total_train_batch_size: 100
+- total_eval_batch_size: 100
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 30.0
+### Training results
+| Training Loss | Epoch | Step  | Bleu    | Gen Len | Validation Loss |
+|:-------------:|:-----:|:-----:|:-------:|:-------:|:---------------:|
+| 1.5823        | 1.0   | 1118  | 11.6257 | 77.1622 | 1.1778          |
+| 0.921         | 2.0   | 2236  | 33.2917 | 76.1459 | 0.6357          |
+| 0.6472        | 3.0   | 3354  | 47.3533 | 75.9194 | 0.4504          |
+| 0.5246        | 4.0   | 4472  | 55.2169 | 75.6871 | 0.3579          |
+| 0.4228        | 5.0   | 5590  | 60.8262 | 75.5777 | 0.3041          |
+| 0.3745        | 6.0   | 6708  | 64.8987 | 75.4424 | 0.2693          |
+| 0.3552        | 7.0   | 7826  | 67.7607 | 75.2438 | 0.2455          |
+| 0.3324        | 8.0   | 8944  | 69.635  | 75.1036 | 0.2274          |
+| 0.2912        | 9.0   | 10062 | 71.3086 | 75.0326 | 0.2117          |
+| 0.2591        | 10.0  | 11180 | 72.392  | 74.9607 | 0.2001          |
+| 0.2471        | 11.0  | 12298 | 73.4758 | 74.9251 | 0.1899          |
+| 0.236         | 12.0  | 13416 | 74.4219 | 74.833  | 0.1822          |
+| 0.2265        | 13.0  | 14534 | 75.1435 | 74.9069 | 0.1745          |
+| 0.2152        | 14.0  | 15652 | 75.7614 | 74.7409 | 0.1695          |
+| 0.2078        | 15.0  | 16770 | 76.2353 | 74.7092 | 0.1641          |
+| 0.2048        | 16.0  | 17888 | 76.7381 | 74.7274 | 0.1593          |
+| 0.1975        | 17.0  | 19006 | 76.9954 | 74.7217 | 0.1559          |
+| 0.1943        | 18.0  | 20124 | 77.421  | 74.6641 | 0.1524          |
+| 0.1987        | 19.0  | 21242 | 77.8231 | 74.6833 | 0.1495          |
+| 0.1855        | 20.0  | 22360 | 78.0784 | 74.6804 | 0.1472          |
+### Framework versions
+- Transformers 4.45.0.dev0
+- Pytorch 2.4.0+cu121
+- Datasets 2.21.0
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,16 @@

+{
+  "bad_words_ids": [
+    [
+      64171
+    ]
+  ],
+  "bos_token_id": 0,
+  "decoder_start_token_id": 64171,
+  "eos_token_id": 0,
+  "forced_eos_token_id": 0,
+  "max_length": 512,
+  "num_beams": 6,
+  "pad_token_id": 64171,
+  "renormalize_logits": true,
+  "transformers_version": "4.45.0.dev0"
+}

runs/Aug17_02-56-40_e2e-68-229/events.out.tfevents.1723877827.e2e-68-229 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:84c6179be5913ec2e869d161035bc5978f651f426822cee995618c8834865f95
-size 486964

 version https://git-lfs.github.com/spec/v1
+oid sha256:922d6718c4cd6b97e9ba80e8de53bb4535887342703fdb67269bdce80af9e222
+size 504971

runs/Aug17_09-33-09_e2e-68-229/events.out.tfevents.1723901621.e2e-68-229 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8da84597d94fb9f3ea844f80da18a8bc7d3c0c0eb244d03e965e12fa26cbc94c
+size 6153

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:df5a342d02a223201bae3c46f56bb8e4ee4fe3ef3d10dd0587d8e728d04e2792
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:6953cfb24240c90c1f285bbf1bef8d8d1a5a659138df0ae4ec8bd47e1851778b
 size 5368