SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling Paper • 2504.08719 • Published Apr 11
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models Paper • 2504.03624 • Published Apr 4 • 13
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 55
RULER: What's the Real Context Size of Your Long-Context Language Models? Paper • 2404.06654 • Published Apr 9, 2024 • 39
Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition Paper • 2210.03255 • Published Oct 6, 2022 • 1
Every child should have parents: a taxonomy refinement algorithm based on hyperbolic term embeddings Paper • 1906.02002 • Published Jun 5, 2019 • 1