Huining Yuan's picture

7

Huining Yuan

HuiningYuan

·

HuiningYuan

AI & ML interests

Reinforcement learning, LLM Agents, World models

Recent Activity

updated a model about 15 hours ago

nics-efc/MARSHAL-Mini-Hanabi-Qwen3-4B

updated a model about 15 hours ago

nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B

updated a model about 15 hours ago

nics-efc/MARSHAL-Tic-Tac-Toe-Qwen3-4B

View all activity

Organizations

upvoted a paper 1 day ago

RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI

Paper • 2602.07837 • Published 4 days ago • 47

upvoted a paper 3 days ago

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Paper • 2509.15965 • Published Sep 19, 2025 • 17

upvoted a paper 7 days ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 8 days ago • 91

upvoted a collection 2 months ago

MARSHAL

MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs 🎉 Accepted by ICLR 2026 • 6 items • Updated about 15 hours ago • 2

upvoted a paper 2 months ago

DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle

Paper • 2512.04324 • Published Dec 3, 2025 • 154

upvoted 2 papers 8 months ago

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19, 2025 • 60

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Paper • 2506.02387 • Published Jun 3, 2025 • 58