thinking-mechanisms Collection of papers research in LLM or AI Agent's thinking mechanisms Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 98 ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute Paper • 2509.04475 • Published Aug 30 • 3 Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper • 2505.17813 • Published May 23 • 57 Deep Think with Confidence Paper • 2508.15260 • Published Aug 21 • 87
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 98
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute Paper • 2509.04475 • Published Aug 30 • 3
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper • 2505.17813 • Published May 23 • 57
long-horizon-agent WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Paper • 2509.06501 • Published Sep 8 • 78
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Paper • 2509.06501 • Published Sep 8 • 78
thinking-mechanisms Collection of papers research in LLM or AI Agent's thinking mechanisms Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 98 ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute Paper • 2509.04475 • Published Aug 30 • 3 Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper • 2505.17813 • Published May 23 • 57 Deep Think with Confidence Paper • 2508.15260 • Published Aug 21 • 87
Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published Sep 9 • 98
ParaThinker: Native Parallel Thinking as a New Paradigm to Scale LLM Test-time Compute Paper • 2509.04475 • Published Aug 30 • 3
Don't Overthink it. Preferring Shorter Thinking Chains for Improved LLM Reasoning Paper • 2505.17813 • Published May 23 • 57
long-horizon-agent WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Paper • 2509.06501 • Published Sep 8 • 78
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Paper • 2509.06501 • Published Sep 8 • 78