DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs Paper • 2503.15793 • Published Mar 20
SYNTHETIC-1 Collection A collection of tasks & verifiers for reasoning datasets • 9 items • Updated Oct 7 • 64
Running 3.52k The Ultra-Scale Playbook 🌌 3.52k The ultimate guide to training LLM on large GPU Clusters