Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
Single-step Diffusion for Image Compression at Ultra-Low Bitrates
WACV 2026
Efficient Text-Guided Convolutional Adapter for the Diffusion Model
WACV 2026
FIRM-MoE:Fine-GrainedExpert Decomposition for Resource-Adaptive MoE Inference
AAAI 2026
OTARo: Once Tuning for All Precisions Toward Robust On-Device LLMs
AAAI 2026
GFT: Graph Feature Tuning for Efficient Point Cloud Analysis
WACV 2026
One-Cycle Structured Pruning via Stability-Driven Subnetwork Search
WACV 2026
Beyond Real Weights: Hypercomplex Representations for Stable Quantization
WACV 2026
TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression
WACV 2026
DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation
AAAI 2026
Prune&Comp: Free Lunch for Layer-Pruned LLMs via Iterative Pruning with Magnitude Compensation
AAAI 2026
Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors
IJCAI 2025
EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
ACL 2025
500xCompressor: Generalized Prompt Compression for Large Language Models
ACL 2025
“Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization
ACL 2025
SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models
NAACL 2025
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models
NAACL 2025
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models
NAACL 2025
LangCompress: Language-Aware Compression of Large Language Models
IJCNLP 2025
Interpreting the Effects of Quantization on LLMs
IJCNLP 2025
AAIG at GenAI Detection Task 1: Exploring Syntactically-Aware, Resource-Efficient Small Autoregressive Decoders for AI Content Detection
COLING 2025
GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs
ACL 2025
Multilingual Iterative Model Pruning: What Matters?
IJCNLP 2025
Quaff: Quantized Parameter-Efficient Fine-Tuning under Outlier Spatial Stability Hypothesis
ACL 2025
Boost Embodied AI Models with Robust Compression Boundary
IJCAI 2025
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
COLING 2025
<
1
2
3
4
5
…
61
>