Denis Tarasov's picture

10 1

Denis Tarasov

Adagrad

·

https://dt6a.github.io/

DT6A

AI & ML interests

RL, NLP

Organizations

authored 6 papers 5 months ago

Revisiting the Minimalist Approach to Offline Reinforcement Learning

Paper • 2305.09836 • Published May 16, 2023 • 3

Distilling LLMs' Decomposition Abilities into Compact Language Models

Paper • 2402.01812 • Published Feb 2, 2024 • 1

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

Paper • 2211.11092 • Published Nov 20, 2022 • 1

Anti-Exploration by Random Network Distillation

Paper • 2301.13616 • Published Jan 31, 2023

Vintix: Action Model via In-Context Reinforcement Learning

Paper • 2501.19400 • Published Jan 31

cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Paper • 2505.22914 • Published May 28 • 36