Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
AdversarialRLHF
/
ppo_pythia410m_tldr6.9b_rm410mdata_mergedsft_prefix_nokl
like
0
Follow
Adversarial Goodhart RLHF
3
Safetensors
gpt_neox
Model card
Files
Files and versions
xet
Community
main
ppo_pythia410m_tldr6.9b_rm410mdata_mergedsft_prefix_nokl
Commit History
Training in progress, step 256
295194e
verified
Muqeeth
commited on
Apr 30
Training in progress, step 208, checkpoint
10eb061
verified
Muqeeth
commited on
Apr 30
Training in progress, step 208
806ffbf
verified
Muqeeth
commited on
Apr 30
Training in progress, step 156, checkpoint
42b6da6
verified
Muqeeth
commited on
Apr 30
Training in progress, step 156
081b165
verified
Muqeeth
commited on
Apr 30
Training in progress, step 104, checkpoint
eb98dfa
verified
Muqeeth
commited on
Apr 30
Training in progress, step 104
d158d98
verified
Muqeeth
commited on
Apr 30
Training in progress, step 52, checkpoint
d67980b
verified
Muqeeth
commited on
Apr 30
Training in progress, step 52
3173c4f
verified
Muqeeth
commited on
Apr 30
initial commit
6fe8672
verified
Muqeeth
commited on
Apr 30