3 9 4

charliezhang

Clockz

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 hour ago

Interplay-LM-Reasoning/composition

updated a dataset about 16 hours ago

Interplay-LM-Reasoning/context

upvoted a paper 21 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

View all activity

Organizations

updated a dataset about 1 hour ago

Interplay-LM-Reasoning/composition

Viewer • Updated about 1 hour ago • 3.8k • 2

updated a dataset about 16 hours ago

Interplay-LM-Reasoning/context

Updated about 16 hours ago • 2

upvoted a paper 21 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 23 days ago • 61

upvoted a paper 27 days ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 98

liked a model about 1 month ago

allenai/Olmo-3.1-7B-RL-Zero-Math

Text Generation • 528k • Updated 9 days ago • 1.6k • 10

New activity in Interplay-LM-Reasoning/extrapolation_midtrain about 1 month ago

Add pipeline tag, GitHub link, and improved model description

#1 opened about 1 month ago by

nielsr

New activity in Interplay-LM-Reasoning/extrapolation_rl about 1 month ago

Improve model card: Add pipeline tag and GitHub link

#1 opened about 1 month ago by

nielsr

updated 2 models about 1 month ago

Interplay-LM-Reasoning/extrapolation_rl

Text Generation • Updated Dec 14, 2025

Interplay-LM-Reasoning/extrapolation_midtrain

Text Generation • Updated Dec 14, 2025

published 2 datasets about 1 month ago

Interplay-LM-Reasoning/context

Updated about 16 hours ago • 2

Interplay-LM-Reasoning/composition

Viewer • Updated about 1 hour ago • 3.8k • 2

published 3 models about 1 month ago

authored a paper about 1 month ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 36

upvoted 2 papers about 1 month ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published Dec 8, 2025 • 36

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 152

updated a model about 2 months ago

goodevening/composition-10B-op-cpt-rl_fixed

Updated Nov 21, 2025

published a model about 2 months ago

goodevening/composition-10B-op-cpt-rl_fixed

Updated Nov 21, 2025

upvoted a paper 3 months ago

The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution

Paper • 2510.25726 • Published Oct 29, 2025 • 45

charliezhang

AI & ML interests

Recent Activity

Organizations

Clockz's activity

Add pipeline tag, GitHub link, and improved model description

Improve model card: Add pipeline tag and GitHub link