Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kastan
/
rlhf-qa-ppo

Text Generation
Transformers
PyTorch
gptj
Model card Files Files and versions
xet
Community
1
rlhf-qa-ppo
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
kastan's picture
kastan
Step 3 of 3; First attempt at a PPO fine-tuned model.
4ba6577 over 2 years ago
  • pytorch_model
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • .gitattributes
    1.48 kB
    initial commit over 2 years ago
  • config.json
    1.1 kB
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • latest
    13 Bytes
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2"

    What is a pickle import?

    8.83 GB
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • random_states_0.pkl

    Detected Pickle imports (7)

    • "torch.ByteStorage",
    • "numpy.core.multiarray._reconstruct",
    • "numpy.ndarray",
    • "collections.OrderedDict",
    • "numpy.dtype",
    • "torch._utils._rebuild_tensor_v2",
    • "_codecs.encode"

    How to fix it?

    16.7 kB
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • scheduler.bin

    Pickle imports

    • No problematic imports detected

    What is a pickle import?

    627 Bytes
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
  • zero_to_fp32.py
    18.9 kB
    Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago