morizon/llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000 Text Generation • 14B • Updated Feb 19 • 1
daichira/llm-jp-3-13b-instruct2-gpro-0222_OpenMATHinstruct_1800_sft_math-tanuki_adapter Updated Feb 24
u-10bei/llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000 Text Generation • 14B • Updated Mar 1 • 1