Inference Providers
Active filters: cerebras
barozp/Qwen3.6-28B-REAP20-A3B-GGUF
Text Generation
• 28B • Updated • 14.1k
• 28
mradermacher/gemma-4-19b-a4b-it-REAP-i1-GGUF
18B • Updated • 11.7k
• 19
Text Generation
• 19B • Updated • 435
• 19
mradermacher/Qwen3-Coder-64B-GGUF
64B • Updated • 869
• 2
mradermacher/Gemma-4-21B-i1-GGUF
21B • Updated • 4.13k
• 2
Text Generation
• 21B • Updated • 1.16k
• 97
mradermacher/Gemma-4-19B-GGUF
18B • Updated • 919
• 1
mradermacher/Gemma-4-21B-GGUF
21B • Updated • 855
• 1
mradermacher/Gemma-4-19B-i1-GGUF
18B • Updated • 3.91k
• 1
mradermacher/Qwen3-Coder-64B-i1-GGUF
64B • Updated • 2.1k
• 1
SebastianSchramm/Cerebras-GPT-111M-instruction
Text Generation
• 0.1B • Updated • 21
• 3
cerebras/Llama3-DocChat-1.0-8B
Text Generation
• Updated • 13
• • 69
NikolayKozloff/Llama3-DocChat-1.0-8B-Q8_0-GGUF
Text Generation
• 8B • Updated • 14
• 6
mattritchey/Llama3-DocChat-1.0-8B-IQ4_NL-GGUF
Text Generation
• 8B • Updated • 21
mattritchey/Llama3-DocChat-1.0-8B-Q4_K_M-GGUF
Text Generation
• 8B • Updated • 9
QuantFactory/Llama3-DocChat-1.0-8B-GGUF
Text Generation
• 8B • Updated • 383
• 1
bartowski/Llama3-DocChat-1.0-8B-GGUF
Text Generation
• 8B • Updated • 915
mradermacher/Llama3-DocChat-1.0-8B-GGUF
8B • Updated • 131
• 1
mradermacher/Llama3-DocChat-1.0-8B-i1-GGUF
8B • Updated • 355
• 1
cerebras/Llama-3-CBHybridL-8B
Text Generation
• 8B • Updated • 8
MatteoKhan/Cerebras-OPT-Fusion
Text Generation
• 7B • Updated • 3
cerebras/Llama-3-CBHybridM-8B
Text Generation
• 8B • Updated • 9
mradermacher/Cerebras-OPT-Fusion-GGUF
7B • Updated • 133
mradermacher/Cerebras-OPT-Fusion-i1-GGUF
7B • Updated • 284
mradermacher/Cerebras-GPT-111M-instruction-GGUF
0.1B • Updated • 108
mradermacher/Cerebras-GPT-111M-instruction-i1-GGUF
0.1B • Updated • 214
• 1
0xSero/GLM-4.6-218B-W4A16
Text Generation
• 2B • Updated • 28
• 8
0xSero/GLM-4.7-REAP-40-W4A16
Text Generation
• 2B • Updated • 98
• 7
Text Generation
• 185B • Updated • 98
• 19
0xSero/GLM-4.7-185B-W4A16
Text Generation
• 2B • Updated • 172
• 69