Submitted by Vasily 80 When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA AIRI - Artificial Intelligence Research Institute 5 2
Submitted by dongguanting 78 Agentic Entropy-Balanced Policy Optimization Renmin University of China 678 4
Submitted by taesiri 65 WithAnyone: Towards Controllable and ID Consistent Image Generation StepFun 102 3
Submitted by zichenwen 60 AI for Service: Proactive Assistance with AI Glasses Shanghai Jiao Tong University 2
Submitted by Paranioar 51 From Pixels to Words -- Towards Native Vision-Language Primitives at Scale SenseTime 49 2
Submitted by xiaochonglinghu 46 ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints AMAP-ML 46 2
Submitted by KID-22 30 Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Ant Group 1 2
Submitted by Keven16 30 LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Tencent Hunyuan 5 2
Submitted by pengyunie 26 TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar University of Waterloo 4 2
Submitted by mukul54 24 Attention Is All You Need for KV Cache in Diffusion LLMs Mohamed Bin Zayed University of Artificial Intelligence 2
Submitted by taesiri 24 PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model PaddlePaddle 57.9k 4
Submitted by taesiri 15 MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning · 14 authors 6 2
Submitted by CheeryLJH 13 VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning NJU-LINK Lab 14 2
Submitted by kenchan0226 13 Large Language Models Do NOT Really Know What They Don't Know Singapore Management University 2
Submitted by han1997 10 VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation Zhejiang University 2
Submitted by XINLI1997 10 COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes Multimodal Art Projection 0 2
Submitted by XINLI1997 9 Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures ByteDance Seed 0 2
Submitted by jyhong836 9 LLMs Can Get "Brain Rot"! Visual Informatics Group @ University of Texas at Austin 7 2
Submitted by bclavie 8 Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Mixedbread 2
Submitted by shenweijie 8 Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning · 13 authors 2
Submitted by MilaWang 8 LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild · 10 authors 2
Submitted by Lakonik 5 pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation Adobe 34 2
Submitted by DaYin 5 LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training UCLA NLP 2
Submitted by hk 5 DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation UCLA NLP 3 2
Submitted by jiwonsong 5 LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning Seoul National University 0 2
Submitted by HJGO 5 VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator · 6 authors 23 2
Submitted by stefan-it 4 The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models CORAL NLP Research 3 2
Submitted by JonasGeiping 3 Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models ELLIS Institute Tübingen 833 2
Submitted by kylemontgomery 2 Budget-aware Test-time Scaling via Discriminative Verification · 7 authors 1 2
Submitted by Robot2050 2 MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems · 6 authors 2
Submitted by SP2001 2 Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms · 7 authors 2
Submitted by shaoweiliu 1 Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation Snapchat Inc. 5 2
Submitted by kylemontgomery 1 Predicting Task Performance with Context-aware Scaling Laws · 7 authors 1 2
Submitted by augustus2011 1 Beyond One World: Benchmarking Super Heros in Role-Playing Across Multiversal Contexts Character-lab 1 2
Submitted by awni00 1 Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning · 4 authors 0 2
Submitted by zhangchen1991 1 RAGCap-Bench: Benchmarking Capabilities of LLMs in Agentic Retrieval Augmented Generation Systems National University of Singapore 2
Submitted by wimmerth 1 AnyUp: Universal Feature Upsampling Max Planck Institute for Informatics 186 2
Submitted by aashiqmuhamed 1 RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models Amazon AGI 2
Submitted by kedaxiaoqiu 1 SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis University of Illinois at Urbana-Champaign 2
Submitted by ZYao720 - GroundedPRM: Tree-Guided and Fidelity-Aware Process Reward Modeling for Step-Level Reasoning Ludwig Maximilian University of Munich 2
Submitted by NickNickGo - Mirror Speculative Decoding: Breaking the Serial Barrier in LLM Inference Apple 2
Submitted by qiranzou - FML-bench: A Benchmark for Automatic ML Research Agents Highlighting the Importance of Exploration Breadth National University of Singapore 3 2