Submitted by Nothing2Say 95 VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models · 7 authors 2
Submitted by taesiri 76 SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines · 32 authors 37 2
Submitted by Sicong 67 MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources · 15 authors 180 2
Submitted by wujie10 45 Seedream 4.0: Toward Next-generation Multimodal Image Generation · 50 authors 6
Submitted by taesiri 26 Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets · 19 authors 3
Submitted by qianlanwyd 17 TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them · 14 authors 2
Submitted by Suu 14 CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning · 8 authors 6 4
Submitted by hyz317 11 CHARM: Control-point-based 3D Anime Hairstyle Auto-Regressive Modeling · 9 authors 10 2
Submitted by Shilin-LU 10 Does FLUX Already Know How to Perform Physically Plausible Image Composition? · 6 authors 2
Submitted by chengle 10 Recon-Act: A Self-Evolving Multi-Agent Browser-Use System via Web Reconnaissance, Tool Generation, and Task Execution · 4 authors 804 2
Submitted by MingLiiii 9 Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory · 9 authors 2
Submitted by QizhiPei 8 ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning · 9 authors 3 2
Submitted by CSJianYang 8 V-GameGym: Visual Game Generation for Code Large Language Models · 12 authors 2
Submitted by taesiri 7 SD3.5-Flash: Distribution-Guided Distillation of Generative Flows Stability AI 2
Submitted by TangJiakai5704 5 Interactive Recommendation Agent with Active User Commands · 15 authors 2
Submitted by augustinLib 5 BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback · 4 authors 9 2
Submitted by chengq9 5 UserRL: Training Interactive User-Centric Agent via Reinforcement Learning · 13 authors 2
Submitted by Jungang 4 MOSS-ChatV: Reinforcement Learning with Process Reasoning Reward for Video Temporal Reasoning · 11 authors 2
Submitted by lx865712528 4 Behind RoPE: How Does Causal Mask Encode Positional Information? · 6 authors 2
Submitted by penfever 4 When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity · 5 authors 3
Submitted by taesiri 4 SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent · 4 authors 2
Submitted by prateekv 3 Thinking While Listening: Simple Test Time Scaling For Audio Classification · 2 authors 2
Submitted by TianheWu 2 The Unanticipated Asymmetry Between Perceptual Optimization and Assessment · 5 authors 1 2
Submitted by pengxiang 2 Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving · 9 authors 2 2
Submitted by huzaifas-sidhpurwala 2 Blueprints of Trust: AI System Cards for End to End Transparency and Governance · 5 authors 1 2
Submitted by taesiri 1 StyleBench: Evaluating thinking styles in Large Language Models · 5 authors 1 2
Submitted by dlion168 1 MI-Fuse: Label Fusion for Unsupervised Domain Adaptation with Closed-Source Large-Audio Language Model · 3 authors 2
Submitted by zx1239856 - OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps · 10 authors 2