kastan
/

rlhf-qa-ppo

Text Generation

Model card Files Files and versions

Ctrl+K

Ctrl+K

1 contributor

History: 3 commits

kastan's picture

Step 3 of 3; First attempt at a PPO fine-tuned model.

4ba6577 over 2 years ago

pytorch_model
Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
.gitattributes

1.48 kB

initial commit over 2 years ago
config.json

1.1 kB

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
latest

13 Bytes

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch.FloatStorage",
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2"
What is a pickle import?
8.83 GB
xet

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
random_states_0.pkl
Detected Pickle imports (7)
- "torch.ByteStorage",
- "numpy.core.multiarray._reconstruct",
- "numpy.ndarray",
- "collections.OrderedDict",
- "numpy.dtype",
- "torch._utils._rebuild_tensor_v2",
- "_codecs.encode"
How to fix it?
16.7 kB
xet

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
scheduler.bin
Pickle imports
- No problematic imports detected
What is a pickle import?
627 Bytes
xet

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
zero_to_fp32.py

18.9 kB

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago