Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

chchen
/
Llama-3.1-8B-Instruct-ppo-250

PEFT
Safetensors
Model card Files Files and versions Community
Llama-3.1-8B-Instruct-ppo-250
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
chchen's picture
chchen
Upload 14 files
4557d00 verified 30 days ago
  • reward
    Upload 14 files 30 days ago
  • .gitattributes
    1.57 kB
    Upload 14 files 30 days ago
  • README.md
    5.11 kB
    Upload 14 files 30 days ago
  • adapter_config.json
    733 Bytes
    Upload 14 files 30 days ago
  • adapter_model.safetensors
    83.9 MB
    LFS
    Upload 14 files 30 days ago
  • special_tokens_map.json
    650 Bytes
    Upload 14 files 30 days ago
  • tokenizer.json
    17.2 MB
    LFS
    Upload 14 files 30 days ago
  • tokenizer_config.json
    55.5 kB
    Upload 14 files 30 days ago
  • trainer_log.jsonl
    2.96 kB
    Upload 14 files 30 days ago
  • trainer_state.json
    2.64 kB
    Upload 14 files 30 days ago
  • training_args.bin

    Detected Pickle imports (9)

    • "torch.device",
    • "accelerate.state.PartialState",
    • "transformers.trainer_utils.SchedulerType",
    • "accelerate.utils.dataclasses.DistributedType",
    • "transformers.trainer_utils.HubStrategy",
    • "transformers.training_args.OptimizerNames",
    • "llamafactory.hparams.training_args.TrainingArguments",
    • "transformers.trainer_pt_utils.AcceleratorConfig",
    • "transformers.trainer_utils.IntervalStrategy"

    How to fix it?

    5.62 kB
    LFS
    Upload 14 files 30 days ago
  • training_loss.png
    34.3 kB
    Upload 14 files 30 days ago
  • training_reward.png
    42.5 kB
    Upload 14 files 30 days ago
  • value_head.safetensors
    16.6 kB
    LFS
    Upload 14 files 30 days ago