Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
- Website
- Community
- Solutions
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2605.14386

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 16 days ago • 78
Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 156
Pretraining Language Models to Ponder in Continuous Space

Paper • 2505.20674 • Published May 27, 2025 • 3

Depth Anything V2

Paper • 2406.09414 • Published Jun 13, 2024 • 103
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published Jun 13, 2024 • 51
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

Paper • 2406.04338 • Published Jun 6, 2024 • 39
SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 122

Sol Interactions

Refusal in Language Models Is Mediated by a Single Direction

Paper • 2406.11717 • Published Jun 17, 2024 • 13
Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 9 days ago • 108
MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Paper • 2605.14906 • Published 9 days ago • 73
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Paper • 2605.15128 • Published 9 days ago • 60

Model Arithmetic

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 140
Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper • 2605.14386 • Published 9 days ago • 59

비드래프트

about 2 hours ago

FINAL-Bench/Metacognitive

Viewer • Updated Feb 27 • 100 • 899 • 90
Running

Featured

50

Leaderboard - FINAL Bench 'Metacognitive'

🚀

50

Metacognitive
Running

79

ALL Bench Leaderboard

🚀

79

ALL Bench Leaderboard
FINAL-Bench/Darwin-4B-Genesis

Text Generation • 8B • Updated 7 days ago • 463 • 33

about 14 hours ago

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 40
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 57
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 37
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 25

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65
Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 15 days ago • 97
Normalizing Trajectory Models

Paper • 2605.08078 • Published 15 days ago • 14
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation

Paper • 2605.08029 • Published 15 days ago • 11

LLM Training Methodologies

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19
Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published Mar 4 • 40
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation

Paper • 2603.15132 • Published Mar 16 • 35
Long Context Pre-Training with Lighthouse Attention

Paper • 2605.06554 • Published 16 days ago • 27

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models.

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 16 days ago • 78
Scaling Latent Reasoning via Looped Language Models

Paper • 2510.25741 • Published Oct 29, 2025 • 229
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 156
Pretraining Language Models to Ponder in Continuous Space

Paper • 2505.20674 • Published May 27, 2025 • 3

비드래프트

about 2 hours ago

FINAL-Bench/Metacognitive

Viewer • Updated Feb 27 • 100 • 899 • 90
Running

Featured

50

Leaderboard - FINAL Bench 'Metacognitive'

🚀

50

Metacognitive
Running

79

ALL Bench Leaderboard

🚀

79

ALL Bench Leaderboard
FINAL-Bench/Darwin-4B-Genesis

Text Generation • 8B • Updated 7 days ago • 463 • 33

Depth Anything V2

Paper • 2406.09414 • Published Jun 13, 2024 • 103
An Image is Worth More Than 16x16 Patches: Exploring Transformers on Individual Pixels

Paper • 2406.09415 • Published Jun 13, 2024 • 51
Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion

Paper • 2406.04338 • Published Jun 6, 2024 • 39
SAM 2: Segment Anything in Images and Videos

Paper • 2408.00714 • Published Aug 1, 2024 • 122

about 14 hours ago

Contrastive Decoding Improves Reasoning in Large Language Models

Paper • 2309.09117 • Published Sep 17, 2023 • 40
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 57
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 37
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published Nov 21, 2024 • 25

Sol Interactions

Refusal in Language Models Is Mediated by a Single Direction

Paper • 2406.11717 • Published Jun 17, 2024 • 13
Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 9 days ago • 108
MemLens: Benchmarking Multimodal Long-Term Memory in Large Vision-Language Models

Paper • 2605.14906 • Published 9 days ago • 73
MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Paper • 2605.15128 • Published 9 days ago • 60

Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections

Paper • 2603.12180 • Published Mar 12 • 65
Flow-OPD: On-Policy Distillation for Flow Matching Models

Paper • 2605.08063 • Published 15 days ago • 97
Normalizing Trajectory Models

Paper • 2605.08078 • Published 15 days ago • 14
STARFlow2: Bridging Language Models and Normalizing Flows for Unified Multimodal Generation

Paper • 2605.08029 • Published 15 days ago • 11

Model Arithmetic

Souper-Model: How Simple Arithmetic Unlocks State-of-the-Art LLM Performance

Paper • 2511.13254 • Published Nov 17, 2025 • 140
Token-Level LLM Collaboration via FusionRoute

Paper • 2601.05106 • Published Jan 8 • 40
Darwin Family: MRI-Trust-Weighted Evolutionary Merging for Training-Free Scaling of Language-Model Reasoning

Paper • 2605.14386 • Published 9 days ago • 59

LLM Training Methodologies

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Paper • 2511.07384 • Published Nov 10, 2025 • 19
Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published Mar 4 • 40
WiT: Waypoint Diffusion Transformers via Trajectory Conflict Navigation

Paper • 2603.15132 • Published Mar 16 • 35
Long Context Pre-Training with Lighthouse Attention

Paper • 2605.06554 • Published 16 days ago • 27

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs