Uploaded model
- Developed by: ntkhoi
- License: apache-2.0
- Finetuned from model : ntkhoi/Qwen3-4B-Medical-SFT-0728
This qwen3 model was trained 2x faster with Unsloth and Huggingface's TRL library.
- Downloads last month
- 5
Model tree for ntkhoi/Qwen3-4B-Medical-DPO-0803
Base model
Qwen/Qwen3-4B-Base
Finetuned
unsloth/Qwen3-4B-Base
Finetuned
ntkhoi/Qwen3-4B-Medical-CPT-0707
Finetuned
ntkhoi/Qwen3-4B-Medical-SFT-0728
