Qwen3-0.6B-W8A8 / recipe.yaml
ramblingpolymath's picture
Upload folder using huggingface_hub
9f6ef3f verified
raw
history blame contribute delete
172 Bytes
default_stage:
default_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.8}
GPTQModifier:
targets: [Linear]
ignore: [lm_head]
scheme: W8A8