microsoft/Phi-4-mini-flash-reasoning Text Generation • 4B • Updated 11 days ago • 9.76k • 205
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 306k • 1.46k
Running 2.85k 2.85k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24 • 973k • • 1.28k
meta-llama/Llama-3.2-3B-Instruct Text Generation • 3B • Updated Oct 24, 2024 • 1.94M • • 1.63k