Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
Yihong Wu
Yihong7788
Follow
ericray007's profile picture
lihengma's profile picture
2 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
20 days ago
It Takes Two: Your GRPO Is Secretly DPO
upvoted
a
paper
29 days ago
On Predictability of Reinforcement Learning Dynamics for Large Language Models
commented
on
a paper
29 days ago
It Takes Two: Your GRPO Is Secretly DPO
View all activity
Organizations
None yet
Yihong7788
's datasets
1
Sort: Recently updated
Yihong7788/Hard_Question_2WIKI_Train
Updated
Apr 24
•
4