arxiv:2509.09674
Yuchen Zhang
YucZhang2003
AI & ML interests
LLM, RL
Recent Activity
liked
a model
13 days ago
PaddlePaddle/PaddleOCR-VL
upvoted
a
paper
about 1 month ago
From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by
Composing Old Ones
authored
a paper
about 2 months ago
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning