wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step700_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Feb 11
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step350_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step50_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_no_prior_think_step100_2026-01-27_21-36-45_nvidia_balanced 8B • Updated Jan 28
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step350_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step300_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step100_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step150_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27 • 2
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step200_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-8b-codeexp_grpo_with_prior_think_step50_2026-01-27_03-19-15_nvidia_balanced 8B • Updated Jan 27
wenwenD/qwen3-4b-codeexp_grpo_no_prior_think_step280_2026-01-25_06-29-13_nvidia_balanced 4B • Updated Jan 25
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_step280_2026-01-25_06-28-54_nvidia_balanced 4B • Updated Jan 25
wenwenD/qwen3-4b-codeexp_grpo_with_prior_think_step280_2026-01-24_07-19-57_nvidia 4B • Updated Jan 24
wenwenD/qwen3-4b-codeexp_grpo_w_prior_think_discount_always1_step175_2026_01_23_21_40_33 4B • Updated Jan 24
wenwenD/qwen7B-instruct-repo_sft_3epcs_w_context-synthetic_multiturn_sft_3epcs 8B • Updated Jun 16, 2025