Submitted by AnthonyPeng 34 Large Reasoning Models Learn Better Alignment from Flawed Thinking MetaSuperintelligenceLab 3
Submitted by zichenwen 30 Efficient Multi-modal Large Language Models via Progressive Consistency Distillation Shanghai Jiao Tong University 9 1
Submitted by SAGE2000 17 Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition The University of Hong Kong 9 3
Submitted by SpiridonSunRotator 16 Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization IST Austria Distributed Algorithms and Systems Lab 2
Submitted by ShijianDeng 11 Self-Improvement in Multimodal Large Language Models: A Survey · 5 authors 2
Submitted by jasonrqh 9 Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents · 11 authors 10 2
Submitted by WeiChihChen 7 Game-Time: Evaluating Temporal Dynamics in Spoken Language Models · 10 authors 2
Submitted by sci-m-wang 6 REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration ContiAI 2
Submitted by thomwolf 4 OpenTSLM: Time-Series Language Models for Reasoning over Multivariate Medical Text- and Time-Series Data OpenTSLM - Open Source Time Series Language Models 2
Submitted by seungheondoh 4 TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling · 3 authors 8 2
Submitted by taesiri 3 FocusAgent: Simple Yet Effective Ways of Trimming the Large Context of Web Agents · 10 authors 2
Submitted by Norrrrrrr 3 WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents · 5 authors 3 2
Submitted by rvandeghen 3 Triangle Splatting+: Differentiable Rendering with Opaque Triangles · 9 authors 2
Submitted by taesiri 2 Improving GUI Grounding with Explicit Position-to-Coordinate Mapping · 7 authors 2
Submitted by Vivre 2 Consolidating Reinforcement Learning for Multimodal Discrete Diffusion Models · 4 authors 5 2
Submitted by weizhech 2 LSPO: Length-aware Dynamic Sampling for Policy Optimization in LLM Reasoning University of Southern California 0 1
Submitted by hjzheng 2 Continuously Augmented Discrete Diffusion model for Categorical Generative Modeling Apple 2
Submitted by Pamela153 2 A Practitioner's Guide to Multi-turn Agentic Reinforcement Learning PEARLS Lab 1
Submitted by stellalisy 2 Personalized Reasoning: Just-In-Time Personalization and Why LLMs Fail At It University of Washington NLP 2
Submitted by jojo23333 2 Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs University of British Columbia 1
Submitted by cmhungsteve 1 LEAML: Label-Efficient Adaptation to Out-of-Distribution Visual Tasks for Multimodal Large Language Models · 4 authors 1
Submitted by taesiri 1 SpineBench: A Clinically Salient, Level-Aware Benchmark Powered by the SpineMed-450k Corpus · 26 authors 2
Submitted by Madddy 1 Dale meets Langevin: A Multiplicative Denoising Diffusion Model Indian Institute of Science 1
Submitted by ethanning 1 Less LLM, More Documents: Searching for Improved RAG Carnegie Mellon University School of Computer Science 2
Submitted by taesiri 1 How Confident are Video Models? Empowering Video Models to Express their Uncertainty · 3 authors 2 2
Submitted by paulcha1025 1 Align Your Tangent: Training Better Consistency Models via Manifold-Aligned Tangents · 3 authors 2
Submitted by wellbeing 1 DiffTester: Accelerating Unit Test Generation for Diffusion LLMs via Repetitive Pattern · 4 authors 2
Submitted by Yuan-avs - NuRisk: A Visual Question Answering Dataset for Agent-Level Risk Assessment in Autonomous Driving Technical University of Munich 1
Submitted by hpouransari - Pretraining with hierarchical memories: separating long-tail and common knowledge Apple 2
Submitted by josephimperial - Scaling Policy Compliance Assessment in Language Models with Policy Reasoning Traces University Of Bath 0 2