CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-with_reasoning Text Generation • 1B • Updated Aug 21 • 3
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_with_reasoning_fixed_DSAI 8B • Updated Aug 19 • 3
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_without_reasoning_fixed_DSAI 8B • Updated Aug 19 • 7
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_256_minibs_16_microbs_16_n_16 2B • Updated Aug 18 • 3
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_512_minibs_16_microbs_16_n_32 2B • Updated Aug 18 • 3
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-without_reasoning Text Generation • 1B • Updated Aug 17 • 2
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_without_reasoning_fixed_DSAI Feature Extraction • 8B • Updated Aug 17 • 5
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_without_reasoning_DSAI Feature Extraction • 8B • Updated Aug 16 • 1
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_fixed_DSAI Feature Extraction • 8B • Updated Aug 16 • 5
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_DSAI Feature Extraction • 8B • Updated Aug 16 • 1
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_128_minibs_32_microbs_32_n_4 2B • Updated Aug 16 • 1
CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_128_minibs_16_microbs_16_n_8 2B • Updated Aug 16 • 1
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-math-4k-with_reasoning Text Generation • 1B • Updated Aug 16 • 4
CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-math-4k-without_reasoning Text Generation • 1B • Updated Aug 15 • 1
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.01.4_orchard Text Generation • 4B • Updated Aug 11 • 7
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.01.2_orchard Text Generation • 4B • Updated Aug 11 • 7
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.00.4_orchard Text Generation • 4B • Updated Aug 11 • 3
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.01.1_orchard Text Generation • 4B • Updated Aug 11 • 7
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.00.2_orchard Text Generation • 4B • Updated Aug 10 • 4
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.01.00.1_orchard Text Generation • 4B • Updated Aug 10 • 4
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.00.01_orchard Text Generation • 4B • Updated Aug 10 • 4
CohenQu/sft_llama3_3b-finemath-4plus.02.02-35000_numina-cot-100k.00.00_orchard Text Generation • 4B • Updated Aug 10 • 7
CohenQu/sft_llama3_3b-finemath-4plus-flexible-ordering.02.04_long-35000_numina-cot-100k_orchard Text Generation • 4B • Updated Jul 24 • 3