VerlTool/acecoder-fsdp_agent-xiaomimimo_mimo-7b-base-grpo-n16-b128-t1.0-lr1e-6-69k-2turn-sys4-110-step 8B • Updated May 11 • 1
VerlTool/acecoder-fsdp_agent-xiaomimimo_mimo-7b-base-grpo-n16-b128-t1.0-lr1e-6-69k-2turn-sys4-120-step 8B • Updated May 16 • 4
VerlTool/acecoder-fsdp_agent-mimo-7b-base-grpo-n16-b128-t1.0-lr1e-6-69k-mtrl-sys9-new2-debug-120-step 8B • Updated May 16 • 1