Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
20
26
31
Rui Yang
Ray2333
Follow
Trangle's profile picture
research4pan's profile picture
testamentaddress01's profile picture
14 followers
·
8 following
https://yangrui2015.github.io
YangRui2015
AI & ML interests
Deep Reinforcement Learning
Recent Activity
upvoted
a
paper
25 days ago
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
upvoted
a
paper
2 months ago
Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities
upvoted
a
paper
3 months ago
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
View all activity
Organizations
Ray2333
's datasets
1
Sort: Recently updated
Ray2333/RiC_harmless_helpful
Viewer
•
Updated
Jul 12, 2024
•
291k
•
118