End of training

Browse files

Files changed (8) hide show

README.md +85 -0
config.json +29 -0
generation_config.json +7 -0
model.safetensors +3 -0
runs/Feb03_08-00-24_1994991c6162/events.out.tfevents.1738569625.1994991c6162.2199.0 +3 -0
runs/Feb03_08-06-51_1994991c6162/events.out.tfevents.1738570012.1994991c6162.4917.0 +3 -0
runs/Feb03_08-16-22_1994991c6162/events.out.tfevents.1738570583.1994991c6162.7971.0 +3 -0
training_args.bin +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,85 @@

+---
+library_name: transformers
+license: bsd-3-clause
+base_model: weathon/smiles_llava
+tags:
+- generated_from_trainer
+metrics:
+- accuracy
+model-index:
+- name: smiles_llava_ft
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# smiles_llava_ft
+This model is a fine-tuned version of [weathon/smiles_llava](https://huggingface.co/weathon/smiles_llava) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.0768
+- Accuracy: 0.7191
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-06
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 16
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_ratio: 0.05
+- num_epochs: 20
+- mixed_precision_training: Native AMP
+- label_smoothing_factor: 0.1
+### Training results
+| Training Loss | Epoch   | Step | Validation Loss | Accuracy |
+|:-------------:|:-------:|:----:|:---------------:|:--------:|
+| 3.3041        | 0.9569  | 100  | 3.5557          | 0.0      |
+| 2.3241        | 1.9091  | 200  | 2.5052          | 0.1835   |
+| 2.029         | 2.8612  | 300  | 2.2936          | 0.5056   |
+| 1.9409        | 3.8134  | 400  | 2.2173          | 0.5693   |
+| 1.9861        | 4.7656  | 500  | 2.1782          | 0.6030   |
+| 1.9564        | 5.7177  | 600  | 2.1461          | 0.6217   |
+| 1.9314        | 6.6699  | 700  | 2.1301          | 0.6704   |
+| 1.8838        | 7.6220  | 800  | 2.1084          | 0.6854   |
+| 1.9538        | 8.5742  | 900  | 2.1052          | 0.7154   |
+| 1.8382        | 9.5263  | 1000 | 2.0955          | 0.7191   |
+| 1.9399        | 10.4785 | 1100 | 2.1008          | 0.6554   |
+| 1.8231        | 11.4306 | 1200 | 2.0939          | 0.6891   |
+| 1.8172        | 12.3828 | 1300 | 2.0899          | 0.6929   |
+| 1.8708        | 13.3349 | 1400 | 2.0800          | 0.7491   |
+| 1.915         | 14.2871 | 1500 | 2.0776          | 0.7116   |
+| 1.8387        | 15.2392 | 1600 | 2.0819          | 0.7041   |
+| 1.8646        | 16.1914 | 1700 | 2.0771          | 0.7228   |
+| 1.7943        | 17.1435 | 1800 | 2.0770          | 0.7041   |
+| 1.8878        | 18.0957 | 1900 | 2.0768          | 0.7154   |
+| 1.841         | 19.0478 | 2000 | 2.0768          | 0.7191   |
+### Framework versions
+- Transformers 4.48.2
+- Pytorch 2.5.1+cu124
+- Datasets 3.2.0
+- Tokenizers 0.21.0

config.json ADDED Viewed

	@@ -0,0 +1,29 @@

+{
+  "_name_or_path": "weathon/smiles_llava",
+  "architectures": [
+    "BlipForConditionalGeneration"
+  ],
+  "image_text_hidden_size": 256,
+  "initializer_factor": 1.0,
+  "initializer_range": 0.02,
+  "label_smoothing": 0.0,
+  "logit_scale_init_value": 2.6592,
+  "model_type": "blip",
+  "projection_dim": 512,
+  "text_config": {
+    "_attn_implementation_autoset": true,
+    "initializer_factor": 1.0,
+    "model_type": "blip_text_model",
+    "num_attention_heads": 12
+  },
+  "torch_dtype": "float32",
+  "transformers_version": "4.48.2",
+  "vision_config": {
+    "_attn_implementation_autoset": true,
+    "dropout": 0.0,
+    "initializer_factor": 1.0,
+    "initializer_range": 0.02,
+    "model_type": "blip_vision_model",
+    "num_channels": 3
+  }
+}

generation_config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "_from_model_config": true,
+  "bos_token_id": 30522,
+  "eos_token_id": 2,
+  "pad_token_id": 0,
+  "transformers_version": "4.48.2"
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4181d2bce4d54fb1c67c0eb62cc788d3c45bf19394dd42d6f45f6dbe36fb58a9
+size 989717056

runs/Feb03_08-00-24_1994991c6162/events.out.tfevents.1738569625.1994991c6162.2199.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cd2bea07837b97940f53258430807936addc959cb8a9a284b90745df5c32a25c
+size 28248

runs/Feb03_08-06-51_1994991c6162/events.out.tfevents.1738570012.1994991c6162.4917.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:62612ce2d130c0fbeb1ba4099769c086f618299af860fa1f92f84835c7bb3a76
+size 116336

runs/Feb03_08-16-22_1994991c6162/events.out.tfevents.1738570583.1994991c6162.7971.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:445d3af242eb6ff915609bff3111591e3b1c1b2527ab51ffcf34094d902d78fc
+size 213913

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8d0ac6db1d231f2acddfa6eac71e9167b6cbc1b35979fc7c710d41872cc2089c
+size 5304