Jun Bai
MSJun
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic
Sampling
upvoted
a
paper
2 months ago
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient
in Latent Space
upvoted
a
paper
3 months ago
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Organizations
None yet