qwen3.5-4b-codex-polar-step72

This is the Hugging Face conversion of the Polar SWE-Gym Slime GRPO checkpoint at iter 73 / train step 72.

Training harness: codex. Base model: Qwen/Qwen3.5-4B.

Downloads last month
23
Safetensors
Model size
5B params
Tensor type
BF16
·
Video Preview
loading

Model tree for billxbf/qwen3.5-4b-codex-polar-step72

Finetuned
Qwen/Qwen3.5-4B
Finetuned
(237)
this model