Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
9
Ru Peng
RuPeng
Follow
Gargaz's profile picture
tahamajs's profile picture
21world's profile picture
5 followers
·
8 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Group Sequence Policy Optimization
upvoted
a
paper
8 days ago
Agentic Reinforced Policy Optimization
upvoted
a
paper
24 days ago
Reinforcement Learning with Rubric Anchors
View all activity
Organizations
None yet
RuPeng
's models
4
Sort: Recently updated
RuPeng/DataMan-MoE-A2.7B-ZH
14B
•
Updated
Aug 9
•
8
RuPeng/DataMan-MoE-A2.7B-EN
14B
•
Updated
Aug 8
•
8
RuPeng/DataMan-1.5B-ZH
2B
•
Updated
Aug 8
•
20
RuPeng/DataMan-1.5B-EN
2B
•
Updated
Aug 7
•
173