Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Denis Tarasov's picture
10 1

Denis Tarasov

Adagrad
vkurenkov's profile picture
·
https://dt6a.github.io/
  • DT6A

AI & ML interests

RL, NLP

Organizations

dunnolab's profile picture

authored 6 papers 5 months ago

Revisiting the Minimalist Approach to Offline Reinforcement Learning

Paper • 2305.09836 • Published May 16, 2023 • 3

Distilling LLMs' Decomposition Abilities into Compact Language Models

Paper • 2402.01812 • Published Feb 2, 2024 • 1

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

Paper • 2211.11092 • Published Nov 20, 2022 • 1

Anti-Exploration by Random Network Distillation

Paper • 2301.13616 • Published Jan 31, 2023

Vintix: Action Model via In-Context Reinforcement Learning

Paper • 2501.19400 • Published Jan 31

cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Paper • 2505.22914 • Published May 28 • 36
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs