-
-
-
-
-
-
Inference Providers
Active filters:
awq
QuantTrio/Qwen3-VL-30B-A3B-Instruct-AWQ
Text Generation
•
31B
•
Updated
•
105k
•
27
QuantTrio/MiniMax-M2-AWQ
Text Generation
•
229B
•
Updated
•
2.99k
•
3
Qwen/Qwen3-32B-AWQ
Text Generation
•
6B
•
Updated
•
462k
•
113
Qwen/Qwen1.5-72B-Chat-AWQ
Text Generation
•
12B
•
Updated
•
1.26k
•
25
nateraw/defog-sqlcoder-70b-alpha-awq
Text Generation
•
10B
•
Updated
•
5
•
2
hugging-quants/Meta-Llama-3.1-405B-Instruct-AWQ-INT4
Text Generation
•
59B
•
Updated
•
643
•
36
Qwen/Qwen2.5-3B-Instruct-AWQ
Text Generation
•
1.0B
•
Updated
•
83.7k
•
13
Qwen/Qwen2.5-7B-Instruct-AWQ
Text Generation
•
2B
•
Updated
•
566k
•
31
Qwen/Qwen2.5-14B-Instruct-AWQ
Text Generation
•
3B
•
Updated
•
97.5k
•
26
AMead10/Llama-3.2-3B-Instruct-AWQ
Text Generation
•
1B
•
Updated
•
724
•
3
gaunernst/gemma-3-27b-it-int4-awq
Image-Text-to-Text
•
6B
•
Updated
•
19.2k
•
33
Qwen/Qwen2.5-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
6B
•
Updated
•
80.6k
•
60
Qwen/Qwen3-8B-AWQ
Text Generation
•
2B
•
Updated
•
227k
•
27
Qwen/Qwen2.5-Omni-7B-AWQ
Any-to-Any
•
5B
•
Updated
•
17.4k
•
12
ReadyArt/Mistral-Small-3.1-DRAFT-0.5B-AWQ
Text Generation
•
0.6B
•
Updated
•
17
•
1
openbmb/MiniCPM-V-4_5-AWQ
Image-Text-to-Text
•
3B
•
Updated
•
5.39k
•
10
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
•
Updated
•
1.51k
•
3
yapwithai/orpheus-3b-trt-int4-awq
Text-to-Speech
•
Updated
•
2
yapwithai/impish-12b-awq
Text Generation
•
12B
•
Updated
•
64
•
1
QuantTrio/Qwen3-VL-32B-Instruct-AWQ
Image-Text-to-Text
•
33B
•
Updated
•
2.11k
•
3
ModelCloud/Marin-32B-Base-GPTQMODEL-AWQ-W4A16
Text Generation
•
33B
•
Updated
•
26
•
1
casperhansen/mpt-7b-8k-chat-awq
Text Generation
•
Updated
•
5
•
3
casperhansen/falcon-7b-awq
Text Generation
•
Updated
•
1
casperhansen/vicuna-7b-v1.5-awq
Text Generation
•
Updated
•
11
•
3
casperhansen/vicuna-7b-v1.5-awq-gemv
Text Generation
•
Updated
•
1
•
1
casperhansen/mpt-7b-8k-chat-awq-gemv
Text Generation
•
Updated
•
1
casperhansen/opt-125m-awq
Text Generation
•
90.3M
•
Updated
•
25
•
3
casperhansen/tinyllama-1b-awq
Text Generation
•
Updated
•
3.34k
Bomml/Llama-2-70B-chat-w4-g128-awq
Text Generation
•
Updated
TheBloke/Llama-2-7B-Chat-AWQ
Text Generation
•
1B
•
Updated
•
3.8k
•
24