Jun Bai's picture

5 5

Jun Bai

MSJun

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

upvoted a paper 2 months ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

upvoted a paper 3 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

View all activity

Organizations

None yet

models 0

None public yet

datasets 0

None public yet