Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

8,466

Full-text search

Active filters: chat

huihui-ai/QwQ-32B-abliterated

Text Generation • 33B • Updated Mar 12, 2025 • 245 • • 105

microsoft/bitnet-b1.58-2B-4T-gguf

Text Generation • 2B • Updated Dec 17, 2025 • 17.1k • 238

bartowski/Goekdeniz-Guelmez_Josiefied-Qwen3-8B-abliterated-v1-GGUF

Text Generation • Updated May 5, 2025 • 3.84k • 32

mradermacher/Huihui-Qwen3-4B-abliterated-v2-GGUF

4B • Updated Jul 31, 2025 • 2.03k • 6

trillionlabs/Tri-21B

Text Generation • 21B • Updated 10 days ago • 2.96k • 46

DavidAU/Qwen3-Zero-Coder-Reasoning-V2-0.8B-NEO-EX-GGUF

Text Generation • 0.8B • Updated Jul 28, 2025 • 3.19k • 19

NousResearch/Hermes-4-14B

Text Generation • 425k • Updated Jan 9 • 3.46k • 119

ethanolivertroy/HackIDLE-NIST-Coder-MLX-4bit

Text Generation • 1B • Updated Oct 14, 2025 • 17 • 2

ValiantLabs/Ministral-3-14B-Reasoning-2512-Esper3.1

Text Generation • 14B • Updated Dec 4, 2025 • 15 • 6

mradermacher/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-i1-GGUF

80B • Updated Dec 23, 2025 • 3.88k • 5

MuXodious/Luna-7B-A4B-absolute-heresy

Text Generation • 7B • Updated 11 days ago • 27 • 4

MuXodious/Luna-7B-A4B-PaperWitch-heresy

Text Generation • 7B • Updated 10 days ago • 67 • 3

mradermacher/Luna-7B-A4B-PaperWitch-heresy-GGUF

7B • Updated 11 days ago • 790 • 2

mradermacher/Luna-7B-A4B-PaperWitch-heresy-i1-GGUF

7B • Updated 11 days ago • 6.68k • 2

mradermacher/Qwen2.5-14B-Instruct-Heretic-i1-GGUF

15B • Updated 4 days ago • 10.9k • 2

Qwen/Qwen1.5-1.8B-Chat-GGUF

Text Generation • 2B • Updated Apr 9, 2024 • 3.27k • 21

sail/Sailor-7B-Chat

Text Generation • 8B • Updated Dec 21, 2024 • 72 • 8

sail/Sailor-1.8B-Chat

Text Generation • 2B • Updated Dec 21, 2024 • 60 • • 6

Qwen/CodeQwen1.5-7B-Chat

Text Generation • 7B • Updated Apr 30, 2024 • 4.25k • 352

wdndev/tiny_llm_sft_92m

Text Generation • 92.1M • Updated May 27, 2024 • 266 • 10

RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a8

Text Generation • 8B • Updated Sep 22, 2025 • 18.2k • 20

Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated Jan 12, 2025 • 505k • 520

anthracite-org/magnum-v2-12b-exl2

Text Generation • Updated Aug 14, 2024 • 3 • 4

NousResearch/Hermes-3-Llama-3.1-405B

Text Generation • Updated Oct 8, 2024 • 145 • 260

bartowski/Hermes-3-Llama-3.1-405B-GGUF

Text Generation • 406B • Updated Sep 27, 2024 • 1.11k • 14

alpindale/Mistral-Large-Instruct-2407-FP8

Text Generation • 123B • Updated Sep 12, 2024 • 25 • 10

bartowski/Qwen2.5-7B-Instruct-GGUF

Text Generation • 8B • Updated Sep 19, 2024 • 46.6k • 46

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4

Text Generation • 15B • Updated Oct 9, 2024 • 93.1k • 26

Qwen/Qwen2.5-0.5B-Instruct-GGUF

Text Generation • 0.6B • Updated Sep 20, 2024 • 56.4k • 75

Qwen/Qwen2.5-1.5B-Instruct-GGUF

Text Generation • 2B • Updated Sep 20, 2024 • 109k • 83