akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_14BDrafter 2B • Updated Jun 16 • 3
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet 2B • Updated Jun 16 • 2
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet_32BDrafter Updated Jun 13
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner_Mini Text Generation • 2B • Updated Jun 11 • 6
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SplitReasoner Text Generation • 2B • Updated Apr 22 • 1
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-SpeculativeReasoner Text Generation • 2B • Updated Apr 19 • 1 • 1
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpeculativeReasoner Text Generation • 2B • Updated Apr 17 • 5
akhauriyash/DeepSeek-R1-Distill-Qwen-1.5B-SpecReasoner_SFT_GRPO_14k_v3 Text Generation • 2B • Updated Apr 15 • 1