kastan
/

rlhf-qa-ppo

Text Generation

Model card Files Files and versions

40.6 GB

1 contributor

History: 2 commits

kastan's picture

Step 3 of 3; First attempt at a PPO fine-tuned model.

959dbed over 2 years ago

pytorch_model
Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
.gitattributes

1.48 kB

initial commit over 2 years ago
latest

13 Bytes

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
random_states_0.pkl

17.7 kB
xet

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
scheduler.bin
Pickle imports
- No problematic imports detected
What is a pickle import?
627 Bytes
xet

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago
zero_to_fp32.py

18.9 kB

Step 3 of 3; First attempt at a PPO fine-tuned model. over 2 years ago