T-LoRA: Single Image Diffusion Model Customization Without Overfitting Paper • 2507.05964 • Published 22 days ago • 113
Heeding the Inner Voice: Aligning ControlNet Training via Intermediate Features Feedback Paper • 2507.02321 • Published 27 days ago • 38
Revisiting the Minimalist Approach to Offline Reinforcement Learning Paper • 2305.09836 • Published May 16, 2023 • 3
Distilling LLMs' Decomposition Abilities into Compact Language Models Paper • 2402.01812 • Published Feb 2, 2024 • 1
Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size Paper • 2211.11092 • Published Nov 20, 2022 • 1
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning Paper • 2505.22914 • Published May 28 • 35
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement Learning Paper • 2505.22914 • Published May 28 • 35
GHOST 2.0: generative high-fidelity one shot transfer of heads Paper • 2502.18417 • Published Feb 25 • 67
LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers Paper • 2502.15007 • Published Feb 20 • 175
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? Paper • 2502.14502 • Published Feb 20 • 91
Unraveling the Complexity of Memory in RL Agents: an Approach for Classification and Evaluation Paper • 2412.06531 • Published Dec 9, 2024 • 73
XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning Paper • 2406.08973 • Published Jun 13, 2024 • 90
Linear Transformers with Learnable Kernel Functions are Better In-Context Models Paper • 2402.10644 • Published Feb 16, 2024 • 82
Revisiting the Minimalist Approach to Offline Reinforcement Learning Paper • 2305.09836 • Published May 16, 2023 • 3