Transition Models: Rethinking the Generative Learning Objective Paper • 2509.04394 • Published Sep 4 • 28
view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl By axolotl-ai-co and 1 other • Apr 4 • 14