Reinforcement Learning for Reasoning in Large Language Models with One Training Example Paper • 2504.20571 • Published Apr 29 • 98
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos Paper • 2406.08407 • Published Jun 12, 2024 • 28
Discriminative Diffusion Models as Few-shot Vision and Language Learners Paper • 2305.10722 • Published May 18, 2023 • 3