Submitted by taesiri 91 ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data · 21 authors 105 4
Submitted by daixuancheng 80 FlowRL: Matching Reward Distributions for LLM Reasoning · 23 authors 41 2
Submitted by yaful 47 Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration · 7 authors 14 3
Submitted by wyu1 29 Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation · 10 authors 19 2
Submitted by YueXY233 25 Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation · 8 authors 2
Submitted by zhangysk 23 FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning · 23 authors 2
Submitted by taesiri 17 RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation · 13 authors 179 2
Submitted by taesiri 16 WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance · 5 authors 2
Submitted by taesiri 7 MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks · 10 authors 2
Submitted by feiliu1 5 RecoWorld: Building Simulated Environments for Agentic Recommender Systems · 15 authors 2
Submitted by onlyairnopods 5 Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization · 8 authors 2
Submitted by LeoLau 4 Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding · 4 authors 6 2
Submitted by xzyao 4 Apertus: Democratizing Open and Compliant LLMs for Global Language Environments · 101 authors 107 2
Submitted by C-Tianyu 3 EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editing · 16 authors 2
Submitted by hao-li 3 Agentic Software Engineering: Foundational Pillars and a Research Roadmap · 7 authors 2
Submitted by mario-sanz 2 Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs · 3 authors 1
Submitted by chaoyinshe 1 EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence · 5 authors 3 2
Submitted by Suzhen - Developer-LLM Conversations: An Empirical Study of Interactions and Generated Code Quality · 3 authors 2
Submitted by zx-Xie - FSG-Net: Frequency-Spatial Synergistic Gated Network for High-Resolution Remote Sensing Change Detection · 8 authors 1 2