Mamba presents a linear-time sequence model with selective state spaces, surpassing Transformers in efficiency for long sequences. It achieves top performance across language, audio, and genomics, boosting inference speed.
https://arxiv.org/abs/2312.00752