-
-
-
-
-
-
Inference Providers
Active filters:
gptq
lk11/opt-125m-gptq-4bit
Text Generation
•
0.1B
•
Updated
•
3
aryamannningombam/gemma-GPTQ_g128-3bits
Text Generation
•
2B
•
Updated
•
3
AminBH/Llama-3-8B-GPTQ-wiki-8bit
Text Generation
•
3B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int2-step36000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int2-step71000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int2-step107000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int2-step110000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int2-step143000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
2
Xu-Ouyang/pythia-14m-int3-step36000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int3-step71000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int3-step107000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
2
Xu-Ouyang/pythia-14m-int3-step110000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int3-step143000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int4-step36000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int4-step71000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int4-step107000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int4-step110000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int4-step143000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int8-step36000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int8-step71000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
AminBH/Llama-3-8B-GPTQ-wiki-4bit
Text Generation
•
2B
•
Updated
•
4
Xu-Ouyang/pythia-14m-int8-step107000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
3
Xu-Ouyang/pythia-14m-int8-step110000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
2
Xu-Ouyang/pythia-14m-int8-step143000-GPTQ-wikitext2
Text Generation
•
0.0B
•
Updated
•
4
tedindie/semikong-8b-awq
Text Generation
•
2B
•
Updated
•
3
infly/INF-34B-Chat-GPTQ-4bit
Text Generation
•
6B
•
Updated
•
3
•
2
infly/INF-34B-Chat-GPTQ-8bit
Text Generation
•
10B
•
Updated
•
3
Nkumah7/gemma-1.1-2b-it-4bit-gptq
Text Generation
•
0.8B
•
Updated
•
4
Xu-Ouyang/pythia-1.4b-deduped-int4-step14000-GPTQ-wikitext2
Text Generation
•
0.4B
•
Updated
•
3
Xu-Ouyang/pythia-1.4b-deduped-int4-step29000-GPTQ-wikitext2
Text Generation
•
0.4B
•
Updated
•
4