2024 ICML ICML 2024

SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks