ChengpengLi's picture

3 13 2

ChengpengLi

ChengpengLi

·

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper 16 days ago

Agentic Entropy-Balanced Policy Optimization

upvoted a paper about 1 month ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

upvoted a paper 3 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

View all activity

Organizations

None yet

authored a paper 8 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

authored 3 papers over 1 year ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 166

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 21

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 17