jaredjoss/pythia-410m-roberta-lr_8e7-kl_01-steps_12000-rlhf-model Text Generation • 0.4B • Updated Aug 6, 2024 • 10
jaredjoss/pythia-410m-roberta-lr_8e7-kl_005-steps_2000-rlhf-model Text Generation • 0.4B • Updated Apr 16, 2024 • 4
jaredjoss/pythia-160m-roberta-lr_1e6-kl_0035-steps_1000-rlhf-model Text Generation • 0.2B • Updated Apr 16, 2024 • 3
jaredjoss/pythia-70m-roberta-lr_3e6-kl_0035-steps_600-rlhf-model Text Generation • 71M • Updated Apr 16, 2024 • 4
jaredjoss/pythia-160m-rlhf-pythia-70m-toxicity-model-v2 Text Generation • 0.2B • Updated Jan 30, 2024 • 5
jaredjoss/pythia-70m-toxicity-model-pythia-160m-rlhf Text Generation • 0.2B • Updated Jan 30, 2024 • 5
jaredjoss/roberta-toxicity-classifier-pythia-160m-rlhf Text Generation • 0.2B • Updated Jan 29, 2024 • 4
jaredjoss/pythia-160m-rlhf-pythia-70m-toxicity-model Text Generation • 0.2B • Updated Jan 27, 2024 • 5