Xinyu Zhu's picture

Xinyu Zhu

TianHongZXY

·

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

updated a model about 11 hours ago

meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval

published a model about 11 hours ago

meng-lab/MATH-Qwen3-8B-Base-GRPO-Serval

liked a dataset 10 days ago

Xnhyacinth/LongBench

View all activity

Organizations

Collections 2

Papers 13

arxiv:2603.00889

arxiv:2506.01347

arxiv:2506.15710

arxiv:2409.18786

models 12

TianHongZXY/CHIMERA-4B-SFT

4B • Updated Mar 2 • 12 • 2

TianHongZXY/CHIMERA-4B-RL

4B • Updated Mar 2 • 13 • 4

TianHongZXY/Qwen3-4B-NSR

4B • Updated Dec 6, 2025 • 3

TianHongZXY/Qwen2.5-Math-7B-GRPO

8B • Updated Jul 28, 2025 • 2

TianHongZXY/OpenR1-Math-46k-8192-Qwen2.5-7B-Instruct-GRPO-clip_0.28

Updated Jul 8, 2025

TianHongZXY/Qwen2.5-Math-7B-W-REINFORCE

8B • Updated Jun 1, 2025 • 5 • 1

TianHongZXY/Qwen3-4B-GRPO

4B • Updated May 31, 2025 • 4

TianHongZXY/Qwen3-4B-PPO

4B • Updated May 31, 2025 • 2

TianHongZXY/Qwen3-4B-PSR

4B • Updated May 31, 2025 • 11 • 1

TianHongZXY/Qwen2.5-Math-7B-PPO

8B • Updated May 31, 2025 • 3

datasets 6

TianHongZXY/CHIMERA

Viewer • Updated 11 days ago • 9.23k • 599 • 21

TianHongZXY/aime-1983-2025

Viewer • Updated Apr 16, 2025 • 963 • 169

TianHongZXY/AIME2025

Viewer • Updated Mar 22, 2025 • 30 • 339 • 1

TianHongZXY/AIME2024

Viewer • Updated Mar 22, 2025 • 30 • 119

TianHongZXY/amc23

Viewer • Updated Mar 22, 2025 • 40 • 323

TianHongZXY/MATH

Viewer • Updated Jan 12, 2025 • 12.5k • 1.01k • 3