Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

compressed-tensors

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

3,108

Full-text search

Active filters: compressed-tensors

warshanks/Jan-nano-128k-AWQ

Text Generation • 1B • Updated Jul 14 • 15

Yi30/Llama-3.2-1B-Instruct-float8_kv

1B • Updated Jul 14 • 3

ramblingpolymath/Qwen3-32B-W8A8

Text Generation • 33B • Updated Aug 2 • 9

daslab-testing/kimi-k2-instruct-gptq-128g-4bit-experts-only

Updated Jul 15 • 3 • 1

context-labs/wynd-vidcap-12b

12B • Updated Jul 14

apolloparty/Devstral-Small-2507-NVFP4A16

14B • Updated Jul 18 • 98 • 1

ramblingpolymath/Qwen3-14B-W8A8

Text Generation • 15B • Updated Aug 3 • 1

ramblingpolymath/Qwen3-8B-W8A8

Text Generation • 8B • Updated Aug 3 • 6

ramblingpolymath/Qwen3-4B-W8A8

Text Generation • 4B • Updated Aug 2 • 7

Yi30/Llama-3.2-1B-Instruct-FP8-KV-llm

1B • Updated Jul 15 • 3

NeoChen1024/gemma-3n-E4B-it-FP8_DYNAMIC

Image-Text-to-Text • 8B • Updated Sep 2 • 1

vagrawal1992/Mistral-7B-Instruct-v0.2-W8A8-INT8

7B • Updated Jul 15 • 3

nm-testing/Qwen3-30B-A3B-NVFP4-0715

17B • Updated Jul 15 • 3

lucck/deepseek-awq-int6

34B • Updated Jul 15 • 1

RedHatAI/Kimi-K2-Instruct-quantized.w4a16

Text Generation • 146B • Updated 25 days ago • 244 • 11

krickwix/Meta-Llama-3-8B-Instruct-W8A8-Dynamic

8B • Updated Jul 15 • 3

ramblingpolymath/qwen3-30B-A3B-w8a8

Text Generation • 31B • Updated Aug 2 • 41

ramblingpolymath/Qwen3-0.6B-W8A8

Text Generation • 0.8B • Updated Aug 3 • 41

joedonino/unsloth_qwen25vl7b_product_descriptionv1_fp8

Image-to-Text • 8B • Updated Jul 16 • 3

weiweiz1/DeepSeek-R1-NVFP4-autoround

Updated Aug 4 • 7

krickwix/Llama-3.1-70B-Instruct-W8A8-Dynamic-Per-Token

71B • Updated Jul 16 • 4

cuongpp/gemma-3-12b-it-GPTQ-4bit

Image-Text-to-Text • 3B • Updated Jul 16 • 71

krickwix/Qwen3-30B-A3B-FP8-Dynamic

31B • Updated Jul 16 • 3

Ba2han/Gemma3-TR-DatasetCreator-w8a8

Image-Text-to-Text • 5B • Updated Jul 16 • 3

nm-testing/Qwen3-0.6B-FP8-BLOCK

0.6B • Updated Jul 16 • 1

weiweiz1/DeepSeek-V2-Lite-NVFP4-autoround

9B • Updated Jul 23 • 1

Ba2han/Gemma3-TR-DatasetCreatorv3-test2

Image-Text-to-Text • 4B • Updated Jul 17 • 3

wangqia0309/Cydonia-24B-v2-FP8-KV

24B • Updated Jul 17 • 1.76k

VAmblardPEReN/mistralai_Mistral-Small-3.2-24B-Instruct-2506-GPTQ

4B • Updated Jul 17 • 43

joedonino/unsloth_qwen25vl7b_product_descriptionv2_fp8

Image-to-Text • 8B • Updated Jul 17