Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
PEFTDiff: Diffusion-Guided Transferability Estimation for Parameter-Efficient Fine-Tuning
ICCV 2025
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models
NAACL 2025
TRNAS: A Training-Free Robust Neural Architecture Search
ICCV 2025
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models
NAACL 2025
Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection
ICCV 2025
MoLA: MoE LoRA with Layer-wise Expert Allocation
NAACL 2025
Dropping Experts, Recombining Neurons: Retraining-Free Pruning for Sparse Mixture-of-Experts LLMs
EMNLP 2025
Avoiding Copyright Infringement via Large Language Model Unlearning
NAACL 2025
FAEDKV: Infinite-Window Fourier Transform for Unbiased KV Cache Compression
EMNLP 2025
MLKV: Multi-Layer Key-Value Heads for Memory Efficient Transformer Decoding
NAACL 2025
Sparsifying Mamba
EMNLP 2025
RankAdaptor: Hierarchical Rank Allocation for Efficient Fine-Tuning Pruned LLMs via Performance Model
NAACL 2025
A Pipeline to Assess Merging Methods via Behavior and Internals
EMNLP 2025
As easy as PIE: understanding when pruning causes language models to disagree
NAACL 2025
SpindleKV: A Novel KV Cache Reduction Method Balancing Both Shallow and Deep Layers
ACL 2025
UNLEARN Efficient Removal of Knowledge in Large Language Models
NAACL 2025
Localized Concept Erasure for Text-to-Image Diffusion Models Using Training-Free Gated Low-Rank Adaptation
CVPR 2025
Unlocking the Potential of Lightweight Quantized Models for Deepfake Detection
IJCAI 2025
Rethinking Removal Attack and Fingerprinting Defense for Model Intellectual Property Protection: A Frequency Perspective
IJCAI 2025
Binary Event-Driven Spiking Transformer
IJCAI 2025
EnergyCompress: A General Case Base Learning Strategy
IJCAI 2025
Block Circulant Adapter for Large Language Models
IJCAI 2025
Not All Layers of LLMs Are Necessary During Inference
IJCAI 2025
Integrating Independent Layer-Wise Rank Selection with Low-Rank SVD Training for Model Compression: A Theory-Driven Approach
IJCAI 2025
ESC: Erasing Space Concept for Knowledge Deletion
CVPR 2025
<
1
…
9
10
11
…
67
>