Edit Models filters

Model Tree

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

287

Base only

Active filters: cais/mmlu

tensorblock/nontoxic-bagel-34b-v0.2-GGUF

34B • Updated Jan 27 • 28

kasinadhsarma/vishwamai-model

Text Generation • Updated Feb 17, 2025

rtt4fb/LlamaCode-Codeforces-v1

Text Generation • Updated Apr 24, 2025 • 1

Biomed-imaging-lab/NeuroRAG

Question Answering • Updated Jun 11, 2025 • 2

CarlOwOs/distilled-qwen3-0.6b-qlora-mmlu

Updated Jun 2, 2025 • 2

CarlOwOs/distilled-qwen3-0.6b-full-mmlu

Text Generation • 0.8B • Updated Jun 2, 2025

tklohj/windyfllm2.2

Updated Jul 2, 2025

tklohj/windyllm_2.3

Question Answering • Updated Jul 3, 2025

ChangyuLiu/DeepSeek-R1-Distill-Qwen-1.5B-GPTQ_W8A8_G128

2B • Updated Jul 27, 2025

ChangyuLiu/DeepSeek-R1-Distill-Qwen-1.5B-GPTQ_FP8_DYNAMIC_G128

2B • Updated Jul 28, 2025 • 3

ChangyuLiu/DeepSeek-R1-Distill-Qwen-7B-GPTQ_W8A8_G128

8B • Updated Jul 28, 2025

ChangyuLiu/DeepSeek-R1-Distill-Llama-8B-GPTQ_W8A8_G128

8B • Updated Jul 28, 2025 • 1

Irfanuruchi/Llama-2-13B-Computer-Engineering

Text Generation • 13B • Updated Sep 24, 2025 • 1

sdobson/nanochat

Text Generation • Updated Oct 15, 2025 • 29

HarleyCooper/nanochat561

Text Generation • 0.6B • Updated Oct 23, 2025 • 32 • 6

sampathchanda/nanochat-d20

Text Generation • Updated Oct 14, 2025

loocorez/nanochat-base-d20-step21400

Updated Oct 15, 2025 • 5

loocorez/nanochat-mid-d20-step765

Updated Oct 15, 2025 • 3

loocorez/nanochat-sft-d20-step650

Updated Oct 15, 2025 • 1

appvoid/distilled-qwen3-0.6b-full-mmlu-Q8_0-GGUF

0.8B • Updated Oct 19, 2025 • 3

db5kb/financial-advice-llm-Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Dec 3, 2025 • 9

SlimFactoryHub/SlimMoE-250M-base

Text Generation • 0.3B • Updated Dec 31, 2025 • 2

SlimFactoryHub/SlimMoE-250M-SFT-v1

Text Generation • 0.3B • Updated Dec 31, 2025 • 2

SlimFactoryHub/SlimMoE-250M-SFT-v2

Text Generation • 0.3B • Updated Dec 31, 2025 • 8

Abigail45/Chyio

Text Generation • Updated Dec 11, 2025

SlimFactoryHub/SlimMoE-250M-instruct

Text Generation • 0.3B • Updated Dec 31, 2025 • 4

AImhotep/GLM-4.7-REAP-265B-mixed-AutoRound

Text Generation • 2B • Updated Feb 6 • 85 • 2

dystrio/Qwen3.5-9B-Sculpt-Default

Text Generation • 9B • Updated Mar 23 • 5

dystrio/Qwen3.5-9B-Sculpt-Production

Text Generation • 9B • Updated Mar 23 • 4

dystrio/Qwen3.5-9B-Sculpt-Throughput

Text Generation • 8B • Updated Mar 23 • 5 • 3