Systematic SFT for Qwen3-4B. We explore diverse dataset compositions and training recipes to benchmark and improve performance across tasks.
AI & ML interests
Pioneering the Next Era of AI with Vector Intelligence
Recent Activity
View all activity
models 35
dnotitia/Qwen3-0.6B-Base
Text Generation • 0.6B • Updated • 7
dnotitia/Qwen3-0.6B
Text Generation • 0.8B • Updated • 5
dnotitia/Qwen3-1.7B-Base
Text Generation • 2B • Updated • 2
dnotitia/Qwen3-1.7B
Text Generation • 2B • Updated • 4
dnotitia/Qwen3-4B-Base
Text Generation • 4B • Updated • 231
dnotitia/Qwen3-4B
Text Generation • 4B • Updated • 318
dnotitia/Qwen3-4B-Instruct-2507
Text Generation • 4B • Updated • 387
dnotitia/Qwen3-4B-Thinking-2507
Text Generation • 4B • Updated • 100
dnotitia/DNA-2.1-14B
Text Generation • 15B • Updated • 4 • 1
dnotitia/DNA-2.0-14B
Text Generation • 15B • Updated • 35 • 11