Revisiting the Minimalist Approach to Offline Reinforcement Learning Paper • 2305.09836 • Published May 16, 2023 • 3
Distilling LLMs' Decomposition Abilities into Compact Language Models Paper • 2402.01812 • Published Feb 2, 2024 • 1
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size Paper • 2211.11092 • Published Nov 20, 2022 • 1
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning Paper • 2505.22914 • Published May 28 • 36