Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation Paper β’ 2507.01957 β’ Published 29 days ago β’ 19
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity Paper β’ 2506.16500 β’ Published Jun 19 β’ 17
SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity Paper β’ 2506.16500 β’ Published Jun 19 β’ 17
Radial Attention: $O(n\log n)$ Sparse Attention with Energy Decay for Long Video Generation Paper β’ 2506.19852 β’ Published Jun 24 β’ 40
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer Paper β’ 2303.17605 β’ Published Mar 30, 2023
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing Paper β’ 2005.14187 β’ Published May 28, 2020 β’ 2
MapPrior: Bird's-Eye View Map Layout Estimation with Generative Models Paper β’ 2308.12963 β’ Published Aug 24, 2023
FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer Paper β’ 2301.08739 β’ Published Jan 20, 2023
LongVILA: Scaling Long-Context Visual Language Models for Long Videos Paper β’ 2408.10188 β’ Published Aug 19, 2024 β’ 53