Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
16
19
Ganqu Cui
ganqu
Follow
Spico's profile picture
SteveSHEN's profile picture
WangSl2004's profile picture
20 followers
·
2 following
cgq15
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
RLPR: Extrapolating RLVR to General Domains without Verifiers
upvoted
a
collection
about 2 months ago
MiniCPM4
upvoted
a
paper
about 2 months ago
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning
View all activity
Organizations
Articles
1
Article
29
Process Reinforcement through Implicit Rewards
Papers
16
arxiv:
2505.22617
arxiv:
2504.16084
arxiv:
2504.14945
arxiv:
2503.21614
Expand 16 papers
models
0
None public yet
datasets
1
ganqu/openbackdoor
Preview
•
Updated
Oct 23, 2024
•
26