Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kastan
/
rlhf-qa-ppo

Text Generation
Transformers
PyTorch
gptj
Model card Files Files and versions
xet
Community
1
rlhf-qa-ppo
40.6 GB
  • 1 contributor
History: 2 commits
kastan's picture
kastan
Step 3 of 3; First attempt at a PPO fine-tuned model.
959dbed over 2 years ago
  • pytorch_model
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • .gitattributes
    1.48 kB
    initial commit over 2 years ago
  • latest
    13 Bytes
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • random_states_0.pkl
    17.7 kB
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • scheduler.bin

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    627 Bytes
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • zero_to_fp32.py
    18.9 kB
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago