2025 ICML ICML 2025

SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models