philschmid
/
dpo-llama-3-1-8b-math

Model card Files Files and versions Metrics Training metrics Community