The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
Xinyu Zhu
TianHongZXY
AI & ML interests
Large Language Models; Reasoning; Reinforcement Learning
Recent Activity
upvoted
a
collection
3 days ago
RLVR-Decomposed
updated
a model
3 days ago
TianHongZXY/Qwen2.5-Math-7B-GRPO
updated
a dataset
9 days ago
TianHongZXY/synthesize_problems