Perception-Aware Policy Optimization for Multimodal Reasoning Paper • 2507.06448 • Published 19 days ago • 43
High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning Paper • 2507.05920 • Published 20 days ago • 11