QuantaAlpha: An Evolutionary Framework for LLM-Driven Alpha Mining Paper • 2602.07085 • Published 4 days ago • 94
F-GRPO: Don't Let Your Policy Learn the Obvious and Forget the Rare Paper • 2602.06717 • Published 4 days ago • 65
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning Paper • 2602.04634 • Published 6 days ago • 89
Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations Paper • 2602.05885 • Published 5 days ago • 26
Reinforcement World Model Learning for LLM-based Agents Paper • 2602.05842 • Published 5 days ago • 21
No One-Size-Fits-All: Building Systems For Translation to Bashkir, Kazakh, Kyrgyz, Tatar and Chuvash Using Synthetic And Original Data Paper • 2602.04442 • Published 6 days ago • 3
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 10 days ago • 266
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 8 days ago • 59
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents Paper • 2601.16973 • Published 18 days ago • 40