distillslm/10000_code_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 7
distillslm/5000_code_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 7
distillslm/10000_math_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 7
distillslm/5000_math_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 6
distillslm/0_code_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 6
distillslm/0_math_supervised_kd_sft_Qwen2.5-3B-Instruct_from_Qwen2.5-7B-Instruct Text Generation • 3B • Updated Feb 6