view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 17 days ago • 93
view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 20 days ago • 104
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 76
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 19 items • Updated 16 days ago • 77
Granite Quantized Models Collection Quantized versions of IBM Granite models. Licensed under the Apache 2.0 license. • 44 items • Updated Nov 21, 2025 • 29
On Path to Multimodal Generalist: General-Level and General-Bench Paper • 2505.04620 • Published May 7, 2025 • 82
Qwen2.5-Omni Collection End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 4 days ago • 160
Josiefied and Abliterated Qwen3 Collection Abliterated, and further fine-tuned to be the most uncensored models available. • 17 items • Updated 19 days ago • 31