Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-GRPO Text Generation • 2B • Updated Jun 6, 2025 • 4
Siheng99/Qwen2.5-Math-1.5B-DeepMath-1024samples-RePO Text Generation • 2B • Updated Jun 6, 2025 • 4
Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-GRPO Text Generation • 8B • Updated Jun 6, 2025 • 6
Siheng99/Qwen2.5-Math-7B-DeepMath-1024samples-RePO Text Generation • 8B • Updated Jun 6, 2025 • 4