2 8 3

Zhepei Wei

weizhepei

https://weizhepei.com

AI & ML interests

None yet

Recent Activity

liked a model 25 days ago

open-thoughts/OpenThinker-Agent-v1

published a dataset about 1 month ago

weizhepei/WebArena-Lite-SFT

upvoted a paper about 2 months ago

VisPlay: Self-Evolving Vision-Language Models from Images

View all activity

Organizations

liked a model 25 days ago

open-thoughts/OpenThinker-Agent-v1

Text Generation • 8B • Updated 29 days ago • 1.63k • 88

published a dataset about 1 month ago

weizhepei/WebArena-Lite-SFT

Viewer • Updated Mar 22, 2025 • 18.9k • 45 • 1

upvoted a paper about 2 months ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 42

updated 2 datasets 2 months ago

weizhepei/TruthRL-HotpotQA

Viewer • Updated Oct 21, 2025 • 7.41k • 13

weizhepei/TruthRL-NaturalQuestions

Viewer • Updated Oct 21, 2025 • 3.61k • 8

updated a dataset 3 months ago

weizhepei/TruthRL-MuSiQue

Viewer • Updated Oct 20, 2025 • 22.4k • 26

published 3 datasets 3 months ago

updated a dataset 3 months ago

weizhepei/TruthRL-CRAG

Viewer • Updated Oct 20, 2025 • 1.3k • 37

published a dataset 3 months ago

weizhepei/TruthRL-CRAG

Viewer • Updated Oct 20, 2025 • 1.3k • 37

upvoted a paper 3 months ago

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Paper • 2510.06217 • Published Oct 7, 2025 • 63

updated a dataset 3 months ago

meng-lab/DeSA-RecallResult

Updated Oct 2, 2025 • 23

upvoted 2 papers 3 months ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3, 2025 • 22

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

commented a paper 3 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55 •

updated a model 3 months ago

meng-lab/DeSA-qwen2.5-3b-it-em-after50stepsacc-step150

3B • Updated Sep 28, 2025 • 6

published a model 3 months ago

meng-lab/DeSA-qwen2.5-3b-it-em-after50stepsacc-step150

3B • Updated Sep 28, 2025 • 6

updated a model 3 months ago

meng-lab/DeSA-qwen2.5-3b-it-stage1-acc

Updated Sep 24, 2025

published a model 3 months ago

meng-lab/DeSA-qwen2.5-3b-it-stage1-acc

Updated Sep 24, 2025

Zhepei Wei

AI & ML interests

Recent Activity

Organizations

weizhepei's activity