Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL Paper • 2410.12491 • Published Oct 16, 2024 • 4
Insights from the Inverse: Reconstructing LLM Training Goals Through Inverse RL Paper • 2410.12491 • Published Oct 16, 2024 • 4
jaredjoss/pythia-410m-roberta-lr_8e7-kl_01-steps_12000-rlhf-model Text Generation • 0.4B • Updated Aug 6, 2024 • 9