Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers Paper • 2504.20752 • Published Apr 29, 2025 • 92
AtlaAI/Selene-1-Mini-Llama-3.1-8B Text Generation • 8B • Updated Jul 25, 2025 • 427 • • 102
RULER: What's the Real Context Size of Your Long-Context Language Models? Paper • 2404.06654 • Published Apr 9, 2024 • 39