2025 ICML ICML 2025

OrthoRank: Token Selection via Sink Token Orthogonality for Efficient LLM inference