Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

compressed-tensors

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

3,003

Full-text search

Active filters: compressed-tensors

allenai/olmOCR-2-7B-1025-FP8

Image-to-Text • 8B • Updated 7 days ago • 15.6k • 112

cerebras/GLM-4.6-REAP-218B-A32B-FP8

Text Generation • 218B • Updated 6 days ago • 413 • 33

zai-org/GLM-4.6-FP8

Text Generation • 358B • Updated 14 days ago • 572k • • 74

cerebras/GLM-4.6-REAP-268B-A32B-FP8

Text Generation • 269B • Updated 6 days ago • 202 • 4

MidnightPhreaker/GLM-4.5-Air-REAP-82B-A12B-GPTQ-INT4-gs32

14B • Updated 7 days ago • 196 • 3

cerebras/GLM-4.6-REAP-252B-A32B-FP8

Text Generation • 252B • Updated 6 days ago • 80 • 3

meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8

Image-Text-to-Text • 402B • Updated May 22 • 200k • • 139

RedHatAI/gemma-3-27b-it-quantized.w4a16

Image-Text-to-Text • 7B • Updated Jun 9 • 50.7k • 9

zai-org/GLM-4.5-FP8

Text Generation • 358B • Updated Aug 12 • 9.29k • 75

zai-org/GLM-4.5-Air-FP8

Text Generation • 111B • Updated Aug 12 • 101k • • 67

inclusionAI/Ring-mini-linear-2.0-GPTQ-int4

Text Generation • 3B • Updated 7 days ago • 423 • 8

inclusionAI/Ring-flash-linear-2.0-GPTQ-int4

Text Generation • 15B • Updated 7 days ago • 180 • 8

cerebras/GLM-4.5-Air-REAP-82B-A12B-FP8

Text Generation • 82B • Updated 3 days ago • 93 • 2

RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic

Text Generation • 8B • Updated Sep 22 • 117k • 7

AtlaAI/Selene-1-Llama-3.3-70B-GPTQ-W8A8

Text Generation • 71B • Updated Jul 25 • 29 • 2

RedHatAI/Mistral-Small-3.1-24B-Instruct-2503-quantized.w4a16

Image-Text-to-Text • 5B • Updated about 5 hours ago • 179k • 9

jeffcookio/Mistral-Small-3.2-24B-Instruct-2506-awq-sym

5B • Updated Jul 4 • 4.05k • 8

RedHatAI/Kimi-K2-Instruct-quantized.w4a16

Text Generation • 146B • Updated 16 days ago • 246 • 11

cpatonn/Qwen3-4B-Thinking-2507-AWQ-4bit

Text Generation • 1B • Updated Aug 6 • 439 • 1

zai-org/GLM-4.5V-FP8

Image-Text-to-Text • 108B • Updated 4 days ago • 486k • • 34

cpatonn/GLM-4.5V-AWQ-4bit

Image-Text-to-Text • 19B • Updated Sep 2 • 983 • 3

allenai/olmOCR-7B-0825-FP8

Image-to-Text • 8B • Updated 7 days ago • 139k • 9

NousResearch/Hermes-4-70B-FP8

Text Generation • 71B • Updated Sep 12 • 521 • 24

scb10x/typhoon2.1-gemma3-12b-fp8

Text Generation • 13B • Updated Aug 31 • 184 • 1

NousResearch/Hermes-4-14B-FP8

Text Generation • 15B • Updated Sep 3 • 1.88k • 11

cpatonn/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit

Text Generation • Updated Sep 24 • 228k • 39

cpatonn/Qwen3-Next-80B-A3B-Thinking-AWQ-4bit

Text Generation • Updated Sep 20 • 29.9k • 14

cpatonn/Tongyi-DeepResearch-30B-A3B-AWQ-8bit

9B • Updated Sep 18 • 54 • 3

dphn/Dolphin-X1-8B-FP8

Text Generation • 8B • Updated 15 days ago • 1.09k • 1

cpatonn/Qwen3-Next-80B-A3B-Thinking-AWQ-8bit

Text Generation • Updated Sep 23 • 333 • 1