OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild Paper • 2511.08423 • Published 29 days ago • 1 • 1
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 2 days ago • 60 • 2
Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs Paper • 2512.07525 • Published 2 days ago • 51 • 2
Decouple to Generalize: Context-First Self-Evolving Learning for Data-Scarce Vision-Language Reasoning Paper • 2512.06835 • Published 3 days ago • 3 • 2
VG-Refiner: Towards Tool-Refined Referring Grounded Reasoning via Agentic Reinforcement Learning Paper • 2512.06373 • Published 4 days ago • 8 • 3
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models Paper • 2512.07783 • Published 2 days ago • 24 • 2
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation Paper • 2512.07831 • Published 2 days ago • 15 • 3
Embodied Referring Expression Comprehension in Human-Robot Interaction Paper • 2512.06558 • Published 4 days ago • 1 • 2
OmniSafeBench-MM: A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack-Defense Evaluation Paper • 2512.06589 • Published 4 days ago • 16 • 2
ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation Paper • 2512.03621 • Published 7 days ago • 8 • 2
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators Paper • 2512.06963 • Published 3 days ago • 2 • 2
The SAM2-to-SAM3 Gap in the Segment Anything Model Family: Why Prompt-Based Expertise Fails in Concept-Driven Image Segmentation Paper • 2512.06032 • Published 6 days ago • 2