Uploaded finetuned model
- Developed by: koutch
- License: apache-2.0
- Finetuned from model : unsloth/SmolLM3-3B
This smollm3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 23
Model tree for koutch/0.json_train_grpo_v2_dev
Base model
HuggingFaceTB/SmolLM3-3B-Base
Finetuned
HuggingFaceTB/SmolLM3-3B
Finetuned
unsloth/SmolLM3-3B
