Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning Paper • 2510.05251 • Published 11 days ago • 7
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning Paper • 2510.05251 • Published 11 days ago • 7 • 3
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning Paper • 2510.05251 • Published 11 days ago • 7 • 3
Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training Paper • 2509.21500 • Published 22 days ago • 17
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published Feb 24 • 12
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published Feb 24 • 12 • 3
Grounded Persuasive Language Generation for Automated Marketing Paper • 2502.16810 • Published Feb 24 • 12
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published Oct 31, 2024 • 63
Beyond Reverse KL: Generalizing Direct Preference Optimization with Diverse Divergence Constraints Paper • 2309.16240 • Published Sep 28, 2023
ReCode: Robustness Evaluation of Code Generation Models Paper • 2212.10264 • Published Dec 20, 2022 • 1
Equipping Transformer with Random-Access Reading for Long-Context Understanding Paper • 2405.13216 • Published May 21, 2024 • 1
Efficient Shapley Values Estimation by Amortization for Text Classification Paper • 2305.19998 • Published May 31, 2023
Word-level Textual Adversarial Attacking as Combinatorial Optimization Paper • 1910.12196 • Published Oct 27, 2019
Narrative Question Answering with Cutting-Edge Open-Domain QA Techniques: A Comprehensive Study Paper • 2106.03826 • Published Jun 7, 2021