2 5

Jiahao Qiu

jiahaoq

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

commented on a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

authored a paper 5 months ago

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

View all activity

Organizations

upvoted a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 81

commented a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 81 •

authored a paper 5 months ago

ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs

Paper • 2506.18896 • Published Jun 23 • 29

commented a paper 6 months ago

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Paper • 2505.20286 • Published May 26 • 8 •

upvoted a paper 6 months ago

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Paper • 2505.20286 • Published May 26 • 8

authored 9 papers 6 months ago

MaxMin-RLHF: Towards Equitable Alignment of Large Language Models with Diverse Human Preferences

Paper • 2402.08925 • Published Feb 14, 2024 • 1

TreeBoN: Enhancing Inference-Time Alignment with Speculative Tree-Search and Best-of-N Sampling

Paper • 2410.16033 • Published Oct 18, 2024

Fast Best-of-N Decoding via Speculative Rejection

Paper • 2410.20290 • Published Oct 26, 2024 • 10

Temporal Consistency for LLM Reasoning Process Error Identification

Paper • 2503.14495 • Published Mar 18 • 11

Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models

Paper • 2503.24377 • Published Mar 31 • 18

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

Paper • 2504.09689 • Published Apr 13 • 6

OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published Apr 21 • 35

On Path to Multimodal Historical Reasoning: HistBench and HistAgent

Paper • 2505.20246 • Published May 26

Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evolution

Paper • 2505.20286 • Published May 26 • 8

updated a dataset 6 months ago

jiahaoq/HistBench

Updated May 27 • 137 • 1

published a dataset 6 months ago

jiahaoq/HistBench

Updated May 27 • 137 • 1

upvoted 2 papers 7 months ago

ToolRL: Reward is All Tool Learning Needs

Paper • 2504.13958 • Published Apr 16 • 48

OTC: Optimal Tool Calls via Reinforcement Learning

Paper • 2504.14870 • Published Apr 21 • 35

upvoted a paper 8 months ago

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

Paper • 2504.09689 • Published Apr 13 • 6

Jiahao Qiu

AI & ML interests

Recent Activity

Organizations

jiahaoq's activity