Jian Hu
chuyi777
AI & ML interests
Reinforcement Learning
Recent Activity
upvoted
a
paper
about 1 month ago
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per
Token via Reinforcement Learning
upvoted
a
paper
2 months ago
BroRL: Scaling Reinforcement Learning via Broadened Exploration
liked
a model
3 months ago
moonshotai/Kimi-K2-Instruct-0905