weiyao_ruc
weiweiruc
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding
upvoted
a
paper
about 1 month ago
Information Gain-based Policy Optimization: A Simple and Effective
Approach for Multi-Turn LLM Agents
upvoted
a
paper
3 months ago
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability
Organizations
None yet