Kaiyan Zhang's picture

Kaiyan Zhang

iseesaw

·

https://iseesaw.github.io/

AI & ML interests

Large Reasoning Models, Reinforcement Learning, Agent

Recent Activity

authored a paper 10 days ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a paper 10 days ago

FlowRL: Matching Reward Distributions for LLM Reasoning

upvoted a collection 10 days ago

View all activity

Organizations

iseesaw 's models

None public yet