Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2
1
5
Yuanhao Wu
wuyhthu
Follow
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning
upvoted
a
paper
about 1 month ago
DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning
commented
on
a paper
about 1 month ago
DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning
View all activity
Organizations
None yet
Papers
3
arxiv:
2506.17533
arxiv:
2501.13264
arxiv:
2401.00396
models
0
None public yet
datasets
2
Sort: Recently updated
wuyhthu/math_shepherd_prm_mixed_reward_dpo
Viewer
•
Updated
Jan 27
•
860k
•
8
wuyhthu/prm800k-phase2
Viewer
•
Updated
Dec 30, 2024
•
485k
•
7