Submitted by taesiri 85 ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data · 21 authors 97 4
Submitted by daixuancheng 75 FlowRL: Matching Reward Distributions for LLM Reasoning · 23 authors 32 2
Submitted by yaful 45 Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Delibration · 7 authors 13 3
Submitted by wyu1 29 Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation · 10 authors 15 2
Submitted by YueXY233 23 Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation · 8 authors 2
Submitted by zhangysk 23 FinSearchComp: Towards a Realistic, Expert-Level Evaluation of Financial Search and Reasoning · 23 authors 2
Submitted by taesiri 13 RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation · 13 authors 176 2
Submitted by taesiri 10 WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance · 5 authors 2
Submitted by taesiri 5 MultiEdit: Advancing Instruction-based Image Editing on Diverse and Challenging Tasks · 10 authors 2
Submitted by onlyairnopods 5 Can Multimodal LLMs See Materials Clearly? A Multimodal Benchmark on Materials Characterization · 8 authors 2
Submitted by LeoLau 4 Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding · 4 authors 5 2
Submitted by xzyao 4 Apertus: Democratizing Open and Compliant LLMs for Global Language Environments · 101 authors 107 2
Submitted by feiliu1 4 RecoWorld: Building Simulated Environments for Agentic Recommender Systems · 15 authors 2
Submitted by C-Tianyu 3 EdiVal-Agent: An Object-Centric Framework for Automated, Scalable, Fine-Grained Evaluation of Multi-Turn Editing · 16 authors 2
Submitted by hao-li 3 Agentic Software Engineering: Foundational Pillars and a Research Roadmap · 7 authors 2
Submitted by mario-sanz 2 Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs · 3 authors 1
Submitted by chaoyinshe 1 EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence · 5 authors 3 2
Submitted by Suzhen - Developer-LLM Conversations: An Empirical Study of Interactions and Generated Code Quality · 3 authors 2
Submitted by zx-Xie - FSG-Net: Frequency-Spatial Synergistic Gated Network for High-Resolution Remote Sensing Change Detection · 8 authors 1 2