Yuanhao Wu's picture

2 2 5

Yuanhao Wu

wuyhthu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Why Language Models Hallucinate

authored a paper 4 months ago

DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning

upvoted a paper 4 months ago

DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning

View all activity

Organizations

None yet

Papers 3

arxiv:2506.17533

arxiv:2501.13264

arxiv:2401.00396

models 0

None public yet

datasets 2

wuyhthu/math_shepherd_prm_mixed_reward_dpo

Viewer • Updated Jan 27 • 860k • 19

wuyhthu/prm800k-phase2

Viewer • Updated Dec 30, 2024 • 485k • 15