24 59 231

Yinxu Pan

cppowboy

https://github.com/Cppowboy

AI & ML interests

RL for LLM, Code&Math Reasoning, Function Calling, Code Interpreter, Vision-Language Pretraining

Recent Activity

liked a model about 2 hours ago

openbmb/VoxCPM-0.5B

upvoted a paper 7 days ago

A Survey of Reinforcement Learning for Large Reasoning Models

upvoted a paper 7 days ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

View all activity

Organizations

liked a model about 2 hours ago

openbmb/VoxCPM-0.5B

Text-to-Speech • Updated 2 days ago • 393 • 271

upvoted 2 papers 7 days ago

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published 8 days ago • 153

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 176

upvoted a paper 9 days ago

WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents

Paper • 2509.06501 • Published 10 days ago • 77

liked 2 datasets 9 days ago

Pageshift-Entertainment/LongPage

Viewer • Updated 13 days ago • 300 • 9.97k • 49

jupyter-agent/jupyter-agent-dataset

Viewer • Updated 8 days ago • 95.8k • 5.19k • 138

New activity in hkust-nlp/WebExplorer-QA 9 days ago

Will the full train dataset be open sourced in the future?

#2 opened 9 days ago by

cppowboy

liked a dataset 9 days ago

hkust-nlp/WebExplorer-QA

Viewer • Updated 9 days ago • 100 • 172 • 4

upvoted a paper 10 days ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published 14 days ago • 169

liked a model 12 days ago

openbmb/MiniCPM4.1-8B

Text Generation • 8B • Updated 2 days ago • 3.42k • 307

liked 2 datasets 17 days ago

MathArena/hmmt_feb_2025

Viewer • Updated May 14 • 30 • 1.33k • 4

nvidia/OpenScienceReasoning-2

Viewer • Updated Jul 31 • 803k • 2.03k • 40

upvoted a paper 20 days ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published 21 days ago • 106

upvoted a paper 22 days ago

Hermes 4 Technical Report

Paper • 2508.18255 • Published 24 days ago • 35

New activity in r2e-edits/SweSmith-RL-Dataset 23 days ago

Are these docker images publicly available?

#2 opened 23 days ago by

cppowboy

liked a model 23 days ago

openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated 3 days ago • 64.4k • 955

New activity in SWE-bench/SWE-smith 24 days ago

您好，请问FAIL_TO_PASS的文件在镜像里为什么没有啊

#6 opened about 1 month ago by

ray075hl

New activity in nebius/SWE-rebench 24 days ago

Could this dataset be repurposed for LLM training?

#7 opened 24 days ago by

cppowboy

liked a dataset 26 days ago

Alibaba-NLP/WebShaper

Viewer • Updated Jul 22 • 500 • 6.3k • 21

liked a dataset 28 days ago

inclusionAI/ASearcher-train-data

Preview • Updated Aug 13 • 546 • 12

Yinxu Pan

AI & ML interests

Recent Activity

Organizations

cppowboy's activity

Will the full train dataset be open sourced in the future?

Are these docker images publicly available?

您好，请问FAIL_TO_PASS的文件在镜像里为什么没有啊

Could this dataset be repurposed for LLM training?