4 22 7

Bingxiang He

hbx

https://hbx-hbx.github.io/

AI & ML interests

NLP

Recent Activity

liked a model about 9 hours ago

openbmb/AgentCPM-Explore

updated a model 17 days ago

hbx/JustRL-Nemotron-1.5B

updated a model 17 days ago

hbx/JustRL-DeepSeek-1.5B

View all activity

Organizations

liked a model about 9 hours ago

openbmb/AgentCPM-Explore

Text Generation • 4B • Updated 1 day ago • 315 • 275

updated 2 models 17 days ago

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated 17 days ago • 724 • 2

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated 17 days ago • 1.04k • 8

upvoted a collection 18 days ago

JustRL

Collection

2 items • Updated Nov 1, 2025 • 5

New activity in hbx/JustRL-Nemotron-1.5B 26 days ago

Add Hugging Face paper link badge to model card

#1 opened 26 days ago by

nielsr

New activity in hbx/JustRL-DeepSeek-1.5B 26 days ago

Improve model card: Update title, add paper link, correct license and citation

#1 opened 26 days ago by

nielsr

commented a paper 27 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 28 days ago • 24 •

upvoted a paper 27 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 28 days ago • 24

submitted a paper to Daily Papers 27 days ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published 28 days ago • 24

upvoted a paper about 2 months ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published Nov 17, 2025 • 134

liked 2 models 2 months ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated 17 days ago • 1.04k • 8

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated 17 days ago • 724 • 2

upvoted a paper 2 months ago

CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents

Paper • 2511.02734 • Published Nov 4, 2025 • 21

updated 2 models 2 months ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated 17 days ago • 1.04k • 8

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated 17 days ago • 724 • 2

updated a collection 3 months ago

JustRL

Collection

2 items • Updated Nov 1, 2025 • 5

published a model 3 months ago

hbx/JustRL-Nemotron-1.5B

Text Generation • 2B • Updated 17 days ago • 724 • 2

updated a collection 3 months ago

JustRL

Collection

2 items • Updated Nov 1, 2025 • 5

updated a model 3 months ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated 17 days ago • 1.04k • 8

published a model 3 months ago

hbx/JustRL-DeepSeek-1.5B

Text Generation • 2B • Updated 17 days ago • 1.04k • 8

Bingxiang He

AI & ML interests

Recent Activity

Organizations

hbx's activity

Add Hugging Face paper link badge to model card

Improve model card: Update title, add paper link, correct license and citation