Qwen/Qwen3-235B-A22B-Instruct-2507 Text Generation β’ 235B β’ Updated Sep 17, 2025 β’ 96.6k β’ β’ 747
RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8-dynamic Text Generation β’ 8B β’ Updated Dec 12, 2025 β’ 32.8k β’ 9
view article Article Building Tensors from Scratch in Rust (Part 1.2): View Operations Jun 18, 2025 β’ 4
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation β’ 8B β’ Updated May 29, 2025 β’ 50.5k β’ β’ 1.02k
Search-R1 Collection Preliminary checkpoints with outcome-only RL. β’ 15 items β’ Updated Aug 12, 2025 β’ 13
Skywork/Skywork-Reward-Llama-3.1-8B-v0.2 Text Classification β’ 8B β’ Updated Oct 25, 2024 β’ 21.7k β’ 39
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published Feb 16, 2025 β’ 166
meta-llama/Llama-3.3-70B-Instruct Text Generation β’ 71B β’ Updated Dec 21, 2024 β’ 413k β’ β’ 2.63k