M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models https://arxiv.org/abs/2504.10449
Junxiong Wang PRO
JunxiongWang
AI & ML interests
Attention Free Model / Subquadratic Language Models
Recent Activity
upvoted
an
article
7 days ago
Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models
updated
a model
3 months ago
JunxiongWang/M1-3B
updated
a model
4 months ago
togethercomputer/M1-3B