Yuchen Fan

yuchenFan

AI & ML interests

None yet

Recent Activity

updated a dataset 6 days ago

yuchenFan/Search-R1

authored a paper about 2 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

upvoted a paper 2 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

View all activity

Organizations

updated a dataset 6 days ago

yuchenFan/Search-R1

Updated 6 days ago • 9

authored a paper about 2 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 127

upvoted a paper 2 months ago

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Paper • 2505.22617 • Published May 28 • 127

published a dataset 2 months ago

yuchenFan/Search-R1

Updated 6 days ago • 9

liked a Space 3 months ago

218

LLM训练终极指南 | The Ultra-Scale Playbook

🔥

了解LLM训练的方方面面

upvoted a paper 3 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 120

upvoted a paper 4 months ago

Unicorn: Text-Only Data Synthesis for Vision Language Model Training

Paper • 2503.22655 • Published Mar 28 • 40

authored a paper 5 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 29

upvoted a paper 5 months ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 29

updated a model 5 months ago

yuchenFan/EDIT-SFT

8B • Updated Feb 26 • 2

published a model 5 months ago

yuchenFan/EDIT-SFT

8B • Updated Feb 26 • 2

updated a model 5 months ago

yuchenFan/Difficulty-Classifier-qwen-7b-inst

8B • Updated Feb 23 • 2

published a model 5 months ago

yuchenFan/Difficulty-Classifier-qwen-7b-inst

8B • Updated Feb 23 • 2

authored a paper 6 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

upvoted 2 papers 6 months ago

Process Reinforcement through Implicit Rewards

Paper • 2502.01456 • Published Feb 3 • 62

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 22

updated a model 6 months ago

PRIME-RL/EurusPRM-Stage2

8B • Updated Feb 19 • 447 • 7

updated a model 7 months ago

PRIME-RL/EurusPRM-Stage1

8B • Updated Feb 19 • 484 • 4

updated a dataset 7 months ago

PRIME-RL/Eurus-2-Rollout

Viewer • Updated Jan 13 • 300k • 20 • 2

liked a model 7 months ago

PRIME-RL/EurusPRM-Stage2

8B • Updated Feb 19 • 447 • 7

Yuchen Fan

AI & ML interests

Recent Activity

Organizations

yuchenFan's activity

LLM训练终极指南 | The Ultra-Scale Playbook