Granite Quantized Models Collection Quantized versions of IBM Granite models. Licensed under the Apache 2.0 license. • 44 items • Updated Nov 21, 2025 • 29
DistilGPT-OSS-qwen3-4B Collection Qwen3 4b trained on GPT-OSS outputs • 2 items • Updated 7 days ago • 2
Gemma 3 QAT Collection Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10, 2025 • 212
MoshiVis v0.1 Collection MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 9 items • Updated 15 days ago • 23
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated Feb 2, 2025 • 61
Instruct Models - Better instruction following. Collection Q6s Instruct models. Other Qs in model listing. Instruct models are better at one shot / api type usage in most cases vs "chat". Listed: old to new. • 90 items • Updated 30 days ago • 5
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 8 days ago • 550
200+ Roleplay, Creative Writing, Uncensored, NSFW models. Collection Oldest models listed first, with Newest models at bottom of the page. Most repos have full examples, instructions, best settings and so on. • 353 items • Updated 1 day ago • 423
Long Context - 16k,32k,64k,128k,200k,256k,512k,1000k Collection Listed oldest to newest. Some with up to 1 million context. • 61 items • Updated 30 days ago • 20
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper • 2402.13753 • Published Feb 21, 2024 • 116