-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 78 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 229 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 156 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
Collections
Discover the best community collections!
Collections including paper arxiv:2605.14386
-
Depth Anything V2
Paper • 2406.09414 • Published • 103 -
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels
Paper • 2406.09415 • Published • 51 -
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
Paper • 2406.04338 • Published • 39 -
SAM 2: Segment Anything in Images and Videos
Paper • 2408.00714 • Published • 122
-
Refusal in Language Models Is Mediated by a Single Direction
Paper • 2406.11717 • Published • 13 -
Self-Distilled Agentic Reinforcement Learning
Paper • 2605.15155 • Published • 108 -
MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models
Paper • 2605.14906 • Published • 73 -
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
Paper • 2605.15128 • Published • 60
-
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance
Paper • 2511.13254 • Published • 140 -
Token-Level LLM Collaboration via FusionRoute
Paper • 2601.05106 • Published • 40 -
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning
Paper • 2605.14386 • Published • 59
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 25
-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
Flow-OPD: On-Policy Distillation for Flow Matching Models
Paper • 2605.08063 • Published • 97 -
Normalizing Trajectory Models
Paper • 2605.08078 • Published • 14 -
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation
Paper • 2605.08029 • Published • 11
-
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation
Paper • 2603.15132 • Published • 35 -
Long Context Pre-Training with Lighthouse Attention
Paper • 2605.06554 • Published • 27
-
Continuous Latent Diffusion Language Model
Paper • 2605.06548 • Published • 78 -
Scaling Latent Reasoning via Looped Language Models
Paper • 2510.25741 • Published • 229 -
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach
Paper • 2502.05171 • Published • 156 -
Pretraining Language Models to Ponder in Continuous Space
Paper • 2505.20674 • Published • 3
-
Depth Anything V2
Paper • 2406.09414 • Published • 103 -
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels
Paper • 2406.09415 • Published • 51 -
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion
Paper • 2406.04338 • Published • 39 -
SAM 2: Segment Anything in Images and Videos
Paper • 2408.00714 • Published • 122
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper • 2309.09117 • Published • 40 -
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 57 -
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding
Paper • 2411.04282 • Published • 37 -
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Paper • 2411.14432 • Published • 25
-
Refusal in Language Models Is Mediated by a Single Direction
Paper • 2406.11717 • Published • 13 -
Self-Distilled Agentic Reinforcement Learning
Paper • 2605.15155 • Published • 108 -
MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models
Paper • 2605.14906 • Published • 73 -
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory
Paper • 2605.15128 • Published • 60
-
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections
Paper • 2603.12180 • Published • 65 -
Flow-OPD: On-Policy Distillation for Flow Matching Models
Paper • 2605.08063 • Published • 97 -
Normalizing Trajectory Models
Paper • 2605.08078 • Published • 14 -
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation
Paper • 2605.08029 • Published • 11
-
Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance
Paper • 2511.13254 • Published • 140 -
Token-Level LLM Collaboration via FusionRoute
Paper • 2601.05106 • Published • 40 -
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning
Paper • 2605.14386 • Published • 59
-
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
Paper • 2511.07384 • Published • 19 -
Believe Your Model: Distribution-Guided Confidence Calibration
Paper • 2603.03872 • Published • 40 -
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation
Paper • 2603.15132 • Published • 35 -
Long Context Pre-Training with Lighthouse Attention
Paper • 2605.06554 • Published • 27