GGUF llama.cpp quantized version of:
- Original model: Qwen3-4B-Instruct-2507
- Model creator: Qwen
- License
Recommended Prompt Format (chatml)
<|im_start|>system
Provide some context and/or instructions to the model.<|im_end|>
<|im_start|>user
The user’s message goes here<|im_end|>
<|im_start|>assistant
AI message goes here<|im_end|>
- Downloads last month
- 15
Hardware compatibility
Log In to add your hardware
4-bit
5-bit
8-bit
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support