Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
5
7
Wei Shen
Swtheking
Follow
ElonTusk2001's profile picture
DtYXs's profile picture
21world's profile picture
5 followers
ยท
2 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
9 days ago
DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation
upvoted
a
paper
3 months ago
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
commented
on
a paper
6 months ago
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
View all activity
Organizations
Papers
6
arxiv:
2505.11896
arxiv:
2504.15843
arxiv:
2504.14655
arxiv:
2503.22230
Expand 6 papers
models
0
None public yet
datasets
0
None public yet