Inference Providers
Active filters: vllm
GadflyII/GLM-4.7-Flash-NVFP4
Text Generation
• Updated • 87.4k
• 69
HarleyWang/Qwen3.5-27B-Claude-Opus-4.6-High-Reasoning
Image-Text-to-Text
• 27B • Updated • 23.7k
• 10
caiovicentino1/Qwen3.5-27B-PolarQuant-Q5
Text Generation
• 27B • Updated • 1.18k
• 7
Lorbus/Qwopus3.5-27B-v3-int4-autoround
6B • Updated • 1.3k
• 3
caiovicentino1/Qwopus3.5-27B-v3-PolarQuant-v7-GPTQ
Text Generation
• 27B • Updated • 3
mistralai/Pixtral-12B-2409
Updated • 16k
• 684
stelterlab/Mistral-Small-24B-Instruct-2501-AWQ
Text Generation
• 24B • Updated • 418k
• 29
mistralai/Mistral-Small-3.1-24B-Instruct-2503
Updated • 540k
• 1.36k
mistralai/Mistral-Small-3.2-24B-Instruct-2506
Updated • 768k
• 577
mistralai/Voxtral-Mini-3B-2507
5B • Updated • 559k
• 636
mlx-community/gpt-oss-20b-MXFP4-Q8
Text Generation
• 21B • Updated • 574k
• 43
mlx-community/gpt-oss-120b-MXFP4-Q8
Text Generation
• 117B • Updated • 11.9k
• 10
mistralai/Magistral-Small-2509
24B • Updated • 18.3k
• 298
openai/gpt-oss-safeguard-20b
Text Generation
• Updated • 40.8k
• • 205
unsloth/gpt-oss-safeguard-20b-GGUF
Text Generation
• 21B • Updated • 738
• 9
mistralai/Ministral-3-3B-Base-2512
4B • Updated • 25k
• 63
mistralai/Ministral-3-3B-Reasoning-2512
4B • Updated • 13.7k
• 111
mistralai/Ministral-3-8B-Reasoning-2512
Updated • 15.7k
• 76
mistralai/Ministral-3-14B-Instruct-2512
Updated • 352k
• 272
mistralai/Ministral-3-8B-Instruct-2512
9B • Updated • 237k
• 162
kldzj/gpt-oss-120b-heretic
Text Generation
• 117B • Updated • 113k
• 15
mistralai/Devstral-2-123B-Instruct-2512
125B • Updated • 121k
• 313
unsloth/Ministral-3-14B-Reasoning-2512-GGUF
14B • Updated • 18.9k
• 44
unsloth/Ministral-3-8B-Instruct-2512-GGUF
8B • Updated • 10.8k
• 31
Text Generation
• 8B • Updated • 693
• 2
bullpoint/Qwen3-Coder-Next-AWQ-4bit
Text Generation
• 14B • Updated • 146k
• 24
MuXodious/gpt-oss-20b-RichardErkhov-heresy-GGUF
Text Generation
• 21B • Updated • 1.48k
• 3
saricles/MiniMax-M2.5-REAP-139B-A10B-NVFP4-GB10
Text Generation
• 79B • Updated • 770
• 7
p-e-w/gpt-oss-20b-heretic-ara-v3
Text Generation
• 2B • Updated • 1.3k
• 27
RedHatAI/Qwen3.5-35B-A3B-FP8-dynamic
Image-Text-to-Text
• 35B • Updated • 1.2k
• 4