Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19, 2024 • 141
Open-Endedness is Essential for Artificial Superhuman Intelligence Paper • 2406.04268 • Published Jun 6, 2024 • 13
Human-Timescale Adaptation in an Open-Ended Task Space Paper • 2301.07608 • Published Jan 18, 2023 • 1