Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
13
2
ChengpengLi
ChengpengLi
Follow
akhaliq's profile picture
AndroidGuy's profile picture
dongguanting's profile picture
4 followers
ยท
8 following
AI & ML interests
LLM for Reasoning, reinforcement learning, recommendation system, diffusion models
Recent Activity
upvoted
a
paper
4 days ago
Agentic Entropy-Balanced Policy Optimization
upvoted
a
paper
23 days ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
upvoted
a
paper
2 months ago
We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning
View all activity
Organizations
None yet
Papers
4
arxiv:
2503.04625
arxiv:
2407.10671
arxiv:
2407.04078
arxiv:
2406.13542
models
1
ChengpengLi/START
Updated
Feb 21
datasets
0
None public yet