long
kevinlong
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 18 hours ago
A Survey of Reinforcement Learning for Large Reasoning Models
commented on
a paper
about 2 months ago
Group Sequence Policy Optimization
upvoted
a
paper
about 2 months ago
RiemannLoRA: A Unified Riemannian Framework for Ambiguity-Free LoRA
Optimization
Organizations
None yet