Zhepei Wei's picture

2 7 2

Zhepei Wei

weizhepei

·

https://weizhepei.com

AI & ML interests

None yet

Recent Activity

updated a dataset 13 days ago

weizhepei/TruthRL-HotpotQA

updated a dataset 13 days ago

weizhepei/TruthRL-NaturalQuestions

updated a dataset 14 days ago

weizhepei/TruthRL-MuSiQue

View all activity

Organizations

upvoted a paper 26 days ago

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Paper • 2510.06217 • Published 27 days ago • 62

upvoted 2 papers about 1 month ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3 • 21

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 54

upvoted a paper 2 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27 • 84

upvoted a paper 3 months ago

R-Zero: Self-Evolving Reasoning LLM from Zero Data

Paper • 2508.05004 • Published Aug 7 • 126

upvoted 2 papers 5 months ago

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

Paper • 2506.01347 • Published Jun 2 • 3

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Paper • 2505.16421 • Published May 22 • 19