2025 ICML ICML 2025

MVA: Linear Attention with High-order Query-Keys Integration and Multi-level Vocabulary Decomposition