Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning Paper • 2510.18849 • Published 28 days ago • 20
Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution Paper • 2509.25301 • Published Sep 29 • 17
A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning Paper • 2510.12838 • Published Oct 13 • 22
ACADREASON: Exploring the Limits of Reasoning Models with Academic Research Problems Paper • 2510.11652 • Published Oct 13 • 28
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published Aug 6 • 127
Efficient Agents: Building Effective Agents While Reducing Cost Paper • 2508.02694 • Published Jul 24 • 85
AFM-Models Collection The models and training dataset of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 12 items • Updated Aug 6 • 16