Jianfeng Gao
wyngjf
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
FlowRL: Matching Reward Distributions for LLM Reasoning
authored
a paper
6 months ago
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language
Models in Math
authored
a paper
6 months ago
Reinforcement Learning for Reasoning in Large Language Models with One
Training Example
Organizations
None yet