Edit Models filters

Inference Providers

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

68

Full-text search

Active filters: compression

prompterminal/nanogpt-shakespeare-compressed

Text Generation • Updated Jul 21 • 15

prompterminal/nanogpt-enwik8-compressed

Text Generation • Updated Jul 21 • 25

prompterminal/nanogpt-enwik8-compressed-working

Text Generation • Updated Jul 21 • 45 • 1

haichaozhang/VQ-Token-llava-ov-0.5b

Video-Text-to-Text • 1B • Updated 12 days ago • 2 • 1

kyne0127/Qwen3-30B-A3B-TopK4-Compressed

Text Generation • 31B • Updated 12 days ago • 11

mradermacher/Qwen3-30B-A3B-TopK4-Compressed-GGUF

31B • Updated 5 days ago • 787

mradermacher/Qwen3-30B-A3B-TopK4-Compressed-i1-GGUF

31B • Updated 6 days ago • 1.05k

ggunio/B2NL-v6.1.2

Updated 7 days ago • 24