Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

67

Base only

Active filters: cerebras

barozp/Qwen3.6-28B-REAP20-A3B-GGUF

Text Generation • 28B • Updated Apr 19 • 14.1k • 28

mradermacher/gemma-4-19b-a4b-it-REAP-i1-GGUF

18B • Updated 3 days ago • 11.7k • 19

0xSero/Gemma-4-19B

Text Generation • 19B • Updated 6 days ago • 435 • 19

mradermacher/Qwen3-Coder-64B-GGUF

64B • Updated 4 days ago • 869 • 2

mradermacher/Gemma-4-21B-i1-GGUF

21B • Updated 4 days ago • 4.13k • 2

0xSero/Gemma-4-21B

Text Generation • 21B • Updated 6 days ago • 1.16k • 97

mradermacher/Gemma-4-19B-GGUF

18B • Updated 5 days ago • 919 • 1

mradermacher/Gemma-4-21B-GGUF

21B • Updated 5 days ago • 855 • 1

mradermacher/Gemma-4-19B-i1-GGUF

18B • Updated 5 days ago • 3.91k • 1

mradermacher/Qwen3-Coder-64B-i1-GGUF

64B • Updated 4 days ago • 2.1k • 1

SebastianSchramm/Cerebras-GPT-111M-instruction

Text Generation • 0.1B • Updated Nov 28, 2023 • 21 • 3

cerebras/Llama3-DocChat-1.0-8B

Text Generation • Updated Aug 16, 2024 • 13 • • 69

NikolayKozloff/Llama3-DocChat-1.0-8B-Q8_0-GGUF

Text Generation • 8B • Updated Aug 21, 2024 • 14 • 6

mattritchey/Llama3-DocChat-1.0-8B-IQ4_NL-GGUF

Text Generation • 8B • Updated Aug 22, 2024 • 21

mattritchey/Llama3-DocChat-1.0-8B-Q4_K_M-GGUF

Text Generation • 8B • Updated Aug 22, 2024 • 9

QuantFactory/Llama3-DocChat-1.0-8B-GGUF

Text Generation • 8B • Updated Aug 24, 2024 • 383 • 1

bartowski/Llama3-DocChat-1.0-8B-GGUF

Text Generation • 8B • Updated Aug 30, 2024 • 915

mradermacher/Llama3-DocChat-1.0-8B-GGUF

8B • Updated Jan 22, 2025 • 131 • 1

mradermacher/Llama3-DocChat-1.0-8B-i1-GGUF

8B • Updated Jan 22, 2025 • 355 • 1

cerebras/Llama-3-CBHybridL-8B

Text Generation • 8B • Updated Mar 26, 2025 • 8

MatteoKhan/Cerebras-OPT-Fusion

Text Generation • 7B • Updated Apr 10, 2025 • 3

cerebras/Llama-3-CBHybridM-8B

Text Generation • 8B • Updated Mar 26, 2025 • 9

mradermacher/Cerebras-OPT-Fusion-GGUF

7B • Updated Mar 5, 2025 • 133

mradermacher/Cerebras-OPT-Fusion-i1-GGUF

7B • Updated Mar 5, 2025 • 284

mradermacher/Cerebras-GPT-111M-instruction-GGUF

0.1B • Updated Jul 11, 2025 • 108

mradermacher/Cerebras-GPT-111M-instruction-i1-GGUF

0.1B • Updated Jul 11, 2025 • 214 • 1

0xSero/GLM-4.6-218B-W4A16

Text Generation • 2B • Updated 6 days ago • 28 • 8

0xSero/GLM-4.7-REAP-40-W4A16

Text Generation • 2B • Updated 6 days ago • 98 • 7

0xSero/GLM-4.7-185B

Text Generation • 185B • Updated 6 days ago • 98 • 19

0xSero/GLM-4.7-185B-W4A16

Text Generation • 2B • Updated 6 days ago • 172 • 69