Making Qwen3 Think in Korean with Reinforcement Learning https://arxiv.org/abs/2508.10355
AI & ML interests
VDPU, SLM, RAG
Recent Activity
Reasoning model distilled from DeepSeek-R1, enhanced with GRPO using supplementary reasoning datasets.
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen3-0.6B
Text Generation • 0.6B • Updated • 8 • 1 -
dnotitia/Smoothie-Qwen3-1.7B
Text Generation • 2B • Updated • 6.54k • 1 -
dnotitia/Smoothie-Qwen3-4B
Text Generation • 4B • Updated • 33 • 2
High-performance LLM developed by Dnotitia Inc., incorporating cutting-edge techniques for superior reasoning tasks.
8B Korean SoTA model, which is instruction-tuned by Dnotitia Inc.
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 5 -
dnotitia/Smoothie-Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 2 • 1 -
dnotitia/Smoothie-Qwen2.5-3B-Instruct
Text Generation • 3B • Updated • 21 • 2
Making Qwen3 Think in Korean with Reinforcement Learning https://arxiv.org/abs/2508.10355
High-performance LLM developed by Dnotitia Inc., incorporating cutting-edge techniques for superior reasoning tasks.
Reasoning model distilled from DeepSeek-R1, enhanced with GRPO using supplementary reasoning datasets.
8B Korean SoTA model, which is instruction-tuned by Dnotitia Inc.
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen3-0.6B
Text Generation • 0.6B • Updated • 8 • 1 -
dnotitia/Smoothie-Qwen3-1.7B
Text Generation • 2B • Updated • 6.54k • 1 -
dnotitia/Smoothie-Qwen3-4B
Text Generation • 4B • Updated • 33 • 2
For more details, please visit https://github.com/dnotitia/smoothie-qwen
-
Smoothie-Qwen: Post-Hoc Smoothing to Reduce Language Bias in Multilingual LLMs
Paper • 2507.05686 • Published • 1 -
dnotitia/Smoothie-Qwen2.5-0.5B-Instruct
Text Generation • 0.5B • Updated • 5 -
dnotitia/Smoothie-Qwen2.5-1.5B-Instruct
Text Generation • 2B • Updated • 2 • 1 -
dnotitia/Smoothie-Qwen2.5-3B-Instruct
Text Generation • 3B • Updated • 21 • 2