Kaiyan Zhang
iseesaw
AI & ML interests
Large Reasoning Models, Reinforcement Learning, Agent
Recent Activity
authored
a paper
10 days ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
paper
10 days ago
FlowRL: Matching Reward Distributions for LLM Reasoning
upvoted
a
collection
10 days ago
DeepSeek-V3.2