Denis Tarasov's picture

10 1

Denis Tarasov

Adagrad

·

https://dt6a.github.io/

DT6A

AI & ML interests

RL, NLP

Recent Activity

upvoted a paper 19 days ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

upvoted a paper 26 days ago

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

authored a paper about 2 months ago

Revisiting the Minimalist Approach to Offline Reinforcement Learning

View all activity

Organizations

upvoted a paper 19 days ago

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published 22 days ago • 113

upvoted a paper 26 days ago

Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback

Paper • 2507.02321 • Published 27 days ago • 38

authored 6 papers about 2 months ago

Revisiting the Minimalist Approach to Offline Reinforcement Learning

Paper • 2305.09836 • Published May 16, 2023 • 3

Distilling LLMs' Decomposition Abilities into Compact Language Models

Paper • 2402.01812 • Published Feb 2, 2024 • 1

Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

Paper • 2211.11092 • Published Nov 20, 2022 • 1

Anti-Exploration by Random Network Distillation

Paper • 2301.13616 • Published Jan 31, 2023

Vintix: Action Model via In-Context Reinforcement Learning

Paper • 2501.19400 • Published Jan 31

cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Paper • 2505.22914 • Published May 28 • 35

upvoted a paper 2 months ago

cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning

Paper • 2505.22914 • Published May 28 • 35

liked a model 3 months ago

JetBrains/Mellum-4b-base

Text Generation • 4B • Updated May 7 • 22.9k • 410

upvoted 3 papers 5 months ago

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published Feb 25 • 67

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 91

upvoted a paper 8 months ago

Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation

Paper • 2412.06531 • Published Dec 9, 2024 • 73

upvoted a paper about 1 year ago

XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning

Paper • 2406.08973 • Published Jun 13, 2024 • 90

upvoted 2 papers over 1 year ago

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 82

Revisiting the Minimalist Approach to Offline Reinforcement Learning

Paper • 2305.09836 • Published May 16, 2023 • 3