arxiv:2509.01055
Hui Chen
chchenhui
AI & ML interests
Machine Learning, Natural language processing
Recent Activity
upvoted
a
paper
25 days ago
Language Models Can Learn from Verbal Feedback Without Scalar Rewards
authored
a paper
about 2 months ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use
upvoted
a
paper
about 2 months ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use