ytaewon's picture

ytaewon

hamzzi

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

How Far Are We from Believable AI Agents? A Framework for Evaluating the Believability of Human Behavior Simulation

upvoted a paper 8 days ago

Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States

commented on a paper 13 days ago

LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

View all activity

Organizations

commented a paper 13 days ago

LIMOPro: Reasoning Refinement for Efficient and Effective Test-time Scaling

Paper • 2505.19187 • Published May 25 • 13 •

commented a paper 2 months ago

Learning from Peers in Reasoning Models

Paper • 2505.07787 • Published May 12 • 46 •

commented a paper 3 months ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 66 •

commented 9 papers 4 months ago

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning

Paper • 2504.00891 • Published Apr 1 • 14 •

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 57 •

A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond

Paper • 2503.21614 • Published Mar 27 • 42 •

Effectively Controlling Reasoning Models through Thinking Intervention

Paper • 2503.24370 • Published Mar 31 • 20 •

OThink-MR1: Stimulating multimodal generalized reasoning capabilities via dynamic reinforcement learning

Paper • 2503.16081 • Published Mar 20 • 28 •

Efficient Inference for Large Reasoning Models: A Survey

Paper • 2503.23077 • Published Mar 29 • 47 •

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published Mar 31 • 63 •

ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback

Paper • 2503.21332 • Published Mar 27 • 23 •

ReFeed: Multi-dimensional Summarization Refinement with Reflective Reasoning on Feedback

Paper • 2503.21332 • Published Mar 27 • 23 •