Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2603.27027

about 6 hours ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published 17 days ago • 154
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published 16 days ago • 141
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published 17 days ago • 153
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 14 days ago • 137

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

Paper • 2602.21548 • Published Feb 25 • 49
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published 16 days ago • 141

zbeeb/Hass-MathInstruct_20epochs

Updated 12 days ago
zbeeb/Hass-ShareGPT_20epochs

Updated 12 days ago • 21
zbeeb/Hass-Sharegpt-Mathinstruct-20epochs

Updated 12 days ago • 16
zbeeb/Hass-Averaged-Checkpoint

Updated 12 days ago • 15

run-hardware-opti

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 26 days ago • 307
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published 16 days ago • 141

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220
Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Paper • 2603.11076 • Published Mar 10 • 5
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 21 days ago • 77

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 421 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

about 6 hours ago

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published 17 days ago • 154
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published 16 days ago • 141
Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published 17 days ago • 153
LongCat-Next: Lexicalizing Modalities as Discrete Tokens

Paper • 2603.27538 • Published 14 days ago • 137

run-hardware-opti

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published 26 days ago • 307
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published 16 days ago • 141

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

Paper • 2602.21548 • Published Feb 25 • 49
TAPS: Task Aware Proposal Distributions for Speculative Sampling

Paper • 2603.27027 • Published 16 days ago • 141

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 220
Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Paper • 2603.09229 • Published Mar 10 • 82
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Paper • 2603.11076 • Published Mar 10 • 5
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

Paper • 2603.21065 • Published 21 days ago • 77

zbeeb/Hass-MathInstruct_20epochs

Updated 12 days ago
zbeeb/Hass-ShareGPT_20epochs

Updated 12 days ago • 21
zbeeb/Hass-Sharegpt-Mathinstruct-20epochs

Updated 12 days ago • 16
zbeeb/Hass-Averaged-Checkpoint

Updated 12 days ago • 15

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8, 2025 • 421 • 99
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12, 2025 • 39
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23, 2025 • 88

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs