Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
chchen
/
Llama-3.1-8B-Instruct-ppo-250
like
0
PEFT
Safetensors
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Use this model
main
Llama-3.1-8B-Instruct-ppo-250
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
chchen
Upload 14 files
4557d00
verified
30 days ago
reward
Upload 14 files
30 days ago
.gitattributes
Safe
1.57 kB
Upload 14 files
30 days ago
README.md
Safe
5.11 kB
Upload 14 files
30 days ago
adapter_config.json
733 Bytes
Upload 14 files
30 days ago
adapter_model.safetensors
83.9 MB
LFS
Upload 14 files
30 days ago
special_tokens_map.json
Safe
650 Bytes
Upload 14 files
30 days ago
tokenizer.json
Safe
17.2 MB
LFS
Upload 14 files
30 days ago
tokenizer_config.json
Safe
55.5 kB
Upload 14 files
30 days ago
trainer_log.jsonl
2.96 kB
Upload 14 files
30 days ago
trainer_state.json
2.64 kB
Upload 14 files
30 days ago
training_args.bin
pickle
Detected Pickle imports (9)
"torch.device"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.SchedulerType"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.training_args.OptimizerNames"
,
"llamafactory.hparams.training_args.TrainingArguments"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.IntervalStrategy"
How to fix it?
5.62 kB
LFS
Upload 14 files
30 days ago
training_loss.png
34.3 kB
Upload 14 files
30 days ago
training_reward.png
42.5 kB
Upload 14 files
30 days ago
value_head.safetensors
16.6 kB
LFS
Upload 14 files
30 days ago