10 19 2

Bo Liu

Benjamin-eecs

https://benjamin-eecs.github.io/

AI & ML interests

Reinforcement Learning, Reasoning, Machine Learning Systems

Recent Activity

authored a paper 4 days ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

upvoted a paper 4 days ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

authored a paper 21 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

View all activity

Organizations

authored a paper 4 days ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published 6 days ago • 12

upvoted a paper 4 days ago

SPICE: Self-Play In Corpus Environments Improves Reasoning

Paper • 2510.24684 • Published 6 days ago • 12

authored a paper 21 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published 25 days ago • 34

upvoted a paper 21 days ago

BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution

Paper • 2510.08697 • Published 25 days ago • 34

authored a paper 24 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published 25 days ago • 256

upvoted 2 papers 24 days ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 57

Agent Learning via Early Experience

Paper • 2510.08558 • Published 25 days ago • 256

liked a Space about 1 month ago

BigCodeArena

🚀

Compare two AI models by sending them code and seeing their responses

authored a paper about 1 month ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 87

upvoted a paper about 1 month ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1 • 87

commented a paper about 1 month ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published Jun 30 • 50 •

authored 2 papers about 1 month ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published Sep 29 • 18

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29 • 138

commented a paper about 1 month ago

Who invented deep residual learning?

Paper • 2509.24732 • Published Sep 29 • 4 •

upvoted 2 papers about 1 month ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

Paper • 2509.25541 • Published Sep 29 • 138

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published Sep 29 • 18

upvoted a paper about 2 months ago

Bootstrapping Task Spaces for Self-Improvement

Paper • 2509.04575 • Published Sep 4 • 5

authored a paper 2 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 83

upvoted a collection 2 months ago

LLaVA-Critic-R1

Collection

6 items • Updated Sep 3 • 2

upvoted a paper 2 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 83

Bo Liu

AI & ML interests

Recent Activity

Organizations

Benjamin-eecs's activity

BigCodeArena