bartowski/mistralai_Devstral-Small-2-24B-Instruct-2512-GGUF Text Generation • 24B • Updated 24 days ago • 39.6k • 42
view article Article Tensor Parallelism (TP) in Transformers: 5 Minutes to Understand 29 days ago • 63
The Bestiary Collection Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated Nov 16, 2025 • 75