view article Article Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face By abidlabs and 4 others • 1 day ago • 55
view article Article Sensitivity Aware Mixed Precision Quantization V1 By badaoui and 1 other • Jun 13 • 18
view article Article How Long Prompts Block Other Requests - Optimizing LLM Performance By tngtech • Jun 12 • 5
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Paper • 2506.18349 • Published Jun 23 • 13
SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation Paper • 2506.18349 • Published Jun 23 • 13 • 2