2024 ICML ICML 2024

LoRAP: Transformer Sub-Layers Deserve Differentiated Structured Compression for Large Language Models