Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Paper • 2502.13145 • Published Feb 18 • 38
DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving Paper • 2411.15139 • Published Nov 22, 2024 • 15
VAD: Vectorized Scene Representation for Efficient Autonomous Driving Paper • 2303.12077 • Published Mar 21, 2023
You Only Look at One Sequence: Rethinking Transformer in Vision through Object Detection Paper • 2106.00666 • Published Jun 1, 2021
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model Paper • 2401.09417 • Published Jan 17, 2024 • 62
ViG: Linear-complexity Visual Sequence Learning with Gated Linear Attention Paper • 2405.18425 • Published May 28, 2024
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving Paper • 2410.22313 • Published Oct 29, 2024