devhem commited on
Commit
970e056
·
verified ·
1 Parent(s): 44a2e58

Model save

Browse files
Files changed (2) hide show
  1. README.md +7 -7
  2. model.safetensors +1 -1
README.md CHANGED
@@ -18,8 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.2234
22
- - Accuracy: 0.5384
23
 
24
  ## Model description
25
 
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
45
  - gradient_accumulation_steps: 4
46
  - total_train_batch_size: 64
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
- - lr_scheduler_type: reduce_lr_on_plateau
49
  - num_epochs: 4
50
  - mixed_precision_training: Native AMP
51
 
@@ -53,10 +53,10 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
55
  |:-------------:|:------:|:----:|:---------------:|:--------:|
56
- | 6.6014 | 1.0 | 543 | 1.4088 | 0.5653 |
57
- | 4.6014 | 2.0 | 1086 | 1.3753 | 0.5688 |
58
- | 3.069 | 3.0 | 1629 | 1.6580 | 0.5522 |
59
- | 1.4178 | 3.9931 | 2168 | 2.2234 | 0.5384 |
60
 
61
 
62
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.7181
22
+ - Accuracy: 0.5725
23
 
24
  ## Model description
25
 
 
45
  - gradient_accumulation_steps: 4
46
  - total_train_batch_size: 64
47
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
48
+ - lr_scheduler_type: linear
49
  - num_epochs: 4
50
  - mixed_precision_training: Native AMP
51
 
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
55
  |:-------------:|:------:|:----:|:---------------:|:--------:|
56
+ | 6.7571 | 1.0 | 645 | 1.3419 | 0.5689 |
57
+ | 4.9223 | 2.0 | 1290 | 1.2715 | 0.5910 |
58
+ | 3.9903 | 3.0 | 1935 | 1.4913 | 0.5767 |
59
+ | 1.32 | 3.9946 | 2576 | 1.7181 | 0.5725 |
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d72a7f881abba2ea534f88a6a1a31105eca852f154b38d1d747a7aac4b5157ed
3
  size 1583458240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:07f876cb7217b4910fb1da7a7f294f3dedd8b4ae8416ef02f2c8965cbb3dd9be
3
  size 1583458240