intfloat/multilingual-e5-large-instruct Feature Extraction • 0.6B • Updated 21 days ago • 2.16M • • 539
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models Paper • 2505.00551 • Published May 1 • 37
view article Article FastRTC: The Real-Time Communication Library for Python By freddyaboulton and 1 other • Feb 25 • 171
Running 1.01k 1.01k FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
Running 2.85k 2.85k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters