← Optimization & Theory

Deep Learning › Optimization & Theory ›

Model Compression

1674 directly classified papers

Papers per year

Papers

PEFTDiff: Diffusion-Guided Transferability Estimation for Parameter-Efficient Fine-Tuning ICCV 2025

QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models NAACL 2025

TRNAS: A Training-Free Robust Neural Architecture Search ICCV 2025

LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models NAACL 2025

Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection ICCV 2025

MoLA: MoE LoRA with Layer-wise Expert Allocation NAACL 2025

Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs EMNLP 2025

Avoiding Copyright Infringement via Large Language Model Unlearning NAACL 2025

FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression EMNLP 2025

MLKV: Multi-Layer Key-Value Heads for Memory Efficient Transformer Decoding NAACL 2025

Sparsifying Mamba EMNLP 2025

RankAdaptor: Hierarchical Rank Allocation for Efficient Fine-Tuning Pruned LLMs via Performance Model NAACL 2025

A Pipeline to Assess Merging Methods via Behavior and Internals EMNLP 2025

As easy as PIE: understanding when pruning causes language models to disagree NAACL 2025

SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers ACL 2025

UNLEARN Efficient Removal of Knowledge in Large Language Models NAACL 2025

Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation CVPR 2025

Unlocking the Potential of Lightweight Quantized Models for Deepfake Detection IJCAI 2025

Rethinking Removal Attack and Fingerprinting Defense for Model Intellectual Property Protection: A Frequency Perspective IJCAI 2025

Binary Event-Driven Spiking Transformer IJCAI 2025

EnergyCompress: A General Case Base Learning Strategy IJCAI 2025

Block Circulant Adapter for Large Language Models IJCAI 2025

Not All Layers of LLMs Are Necessary During Inference IJCAI 2025

Integrating Independent Layer-Wise Rank Selection with Low-Rank SVD Training for Model Compression: A Theory-Driven Approach IJCAI 2025

ESC: Erasing Space Concept for Knowledge Deletion CVPR 2025