Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models Paper • 2411.08733 • Published Nov 13, 2024 • 1