Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ChengpengLi's picture
3 13 2

ChengpengLi

ChengpengLi
RichardQRQ's profile picture akhaliq's profile picture AndroidGuy's profile picture
·

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper 16 days ago
Agentic Entropy-Balanced Policy Optimization
upvoted a paper about 1 month ago
Quantile Advantage Estimation for Entropy-Safe Reasoning
upvoted a paper 3 months ago
We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning
View all activity

Organizations

None yet

authored a paper 8 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113
authored 3 papers over 1 year ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 166

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 21

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 17
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs