Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs Paper • 2506.14731 • Published Jun 17 • 8
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model Paper • 2510.18855 • Published 13 days ago • 62
Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model Paper • 2510.18855 • Published 13 days ago • 62
Ring Collection Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI, derived from Ling. • 5 items • Updated 21 days ago • 19
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs Paper • 2506.14731 • Published Jun 17 • 8