Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

7,212

Full-text search

Active filters: awq

mratsim/MiniMax-M2.1-FP8-INT4-AWQ

Text Generation • 39B • Updated 4 days ago • 590 • 9

QuantTrio/GLM-4.7-AWQ

Text Generation • 358B • Updated 13 days ago • 18.6k • 18

QuantTrio/MiniMax-M2.1-AWQ

Text Generation • 229B • Updated 11 days ago • 5.49k • 8

hugging-quants/Meta-Llama-3.1-8B-Instruct-AWQ-INT4

Text Generation • 8B • Updated Aug 7, 2024 • 147k • 84

gaunernst/gemma-3-4b-it-int4-awq

Image-Text-to-Text • Updated Apr 6, 2025 • 39.1k • 5

stelterlab/DeepSeek-R1-0528-Qwen3-8B-AWQ

Text Generation • 8B • Updated Jun 4, 2025 • 5.76k • 4

twhitworth/gpt-oss-120b-awq-w4a16

117B • Updated Aug 19, 2025 • 2.46k • 19

solidrust/Llama-3-8B-Lexi-Uncensored-AWQ

Text Generation • 8B • Updated Sep 3, 2024 • 155k • 5

casperhansen/mistral-large-instruct-2407-awq

Text Generation • 123B • Updated Jul 25, 2024 • 173 • 5

Qwen/Qwen2.5-7B-Instruct-AWQ

Text Generation • 8B • Updated Oct 9, 2024 • 263k • 35

kosbu/Llama-3.3-70B-Instruct-AWQ

Text Generation • 71B • Updated Dec 7, 2024 • 338k • 7

casperhansen/deepseek-r1-distill-qwen-14b-awq

15B • Updated Feb 8, 2025 • 7.47k • 14

inarikami/DeepSeek-R1-Distill-Qwen-32B-AWQ

Text Generation • 33B • Updated Jan 23, 2025 • 4.7k • 11

Qwen/Qwen2.5-VL-72B-Instruct-AWQ

Image-Text-to-Text • 74B • Updated Mar 7, 2025 • 167k • 71

Qwen/Qwen3-32B-AWQ

Text Generation • 33B • Updated May 21, 2025 • 86.6k • 119

Qwen/Qwen3-8B-AWQ

Text Generation • 8B • Updated May 21, 2025 • 112k • 32

Eslzzyl/Qwen3-4B-Instruct-2507-AWQ

Text Generation • 4B • Updated Aug 12, 2025 • 4.13k • 1

QuantTrio/MiniMax-M2-AWQ

Text Generation • 229B • Updated Dec 3, 2025 • 373k • 9

QuantTrio/MiniMax-M2-REAP-162B-A10B-AWQ

Text Generation • 162B • Updated 6 days ago • 580 • 3

QuantTrio/DeepSeek-V3.2-AWQ

Text Generation • 685B • Updated Dec 3, 2025 • 2.94k • 9

nn-tech/MetalGPT-1-AWQ

Text Generation • 33B • Updated 15 days ago • 451 • 5

cybermotaz/nemotron3-nano-nvfp4-w4a16

Text Generation • 18B • Updated 23 days ago • 14k • 8

CultriX/Nevoria-R1-70b-AWQ-W4A16-g128

Text Generation • 11B • Updated 6 days ago • 230 • 1

TheHouseOfTheDude/GLM-4.7_Compressed-Tensors

Text Generation • Updated 14 days ago • 11 • 4

casperhansen/mpt-7b-8k-chat-awq

Text Generation • Updated Nov 4, 2023 • 24 • 3

casperhansen/falcon-7b-awq

Text Generation • Updated Nov 4, 2023 • 22 • 1

casperhansen/vicuna-7b-v1.5-awq

Text Generation • Updated Oct 31, 2023 • 25 • 3

casperhansen/vicuna-7b-v1.5-awq-gemv

Text Generation • Updated Oct 31, 2023 • 18 • 1

casperhansen/mpt-7b-8k-chat-awq-gemv

Text Generation • Updated Oct 31, 2023 • 17

casperhansen/opt-125m-awq

Text Generation • 0.2B • Updated Oct 31, 2023 • 93 • 3