HerrHruby/sft_qwen_3_1p7b_reasoning_cache_e2e_deepscaler_16k_2048_toks_560_steps 2B • Updated 1 day ago • 12
HerrHruby/sft_qwen_3_1p7b_reasoning_cache_e2e_deepscaler_16k_2048_toks_400_steps 2B • Updated 1 day ago • 11
HerrHruby/sft_qwen_3_1p7b_reasoning_cache_seq_sft_16k_1024_toks_1500_steps 2B • Updated 4 days ago • 6
HerrHruby/reasoning_cache_deepscalr_16k_sft_e2e_prompt_summaries_1024_2048_73k Updated about 7 hours ago
HerrHruby/reasoning_cache_deepscalr_16k_sft_e2e_prompt_summaries_2048_small Updated about 19 hours ago