2024 ICML ICML 2024

Short-Long Convolutions Help Hardware-Efficient Linear Attention to Focus on Long Sequences