This model is an int4 model with group_size 128 and symmetric quantization of Qwen/Qwen2-0.5B-Instruct generated by intel/auto-round algorithm.

Mainly for vllm ut

Downloads last month
1,836
Safetensors
Model size
184M params
Tensor type
I32
BF16
F16
Inference Providers NEW
This model isn't deployed by any Inference Provider. 馃檵 Ask for provider support