Advantage-Guided Distillation for Preference Alignment in Small Language Models Paper • 2502.17927 • Published Feb 25 • 1