Li
flounder123
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 3 hours ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
upvoted
a
paper
about 2 months ago
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVR
upvoted
a
paper
5 months ago
STAR-R1: Spatial TrAnsformation Reasoning by Reinforcing Multimodal LLMs
Organizations
None yet