RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search Paper โข 2504.15047 โข Published Apr 21 โข 6
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper โข 2503.16219 โข Published Mar 20 โข 51