Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense Paper • 2510.07242 • Published 26 days ago • 30
Debate or Vote: Which Yields Better Decisions in Multi-Agent Large Language Models? Paper • 2508.17536 • Published Aug 24 • 1