Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
MSQ: Memory-Efficient Bit Sparsification Quantization
ICCV 2025
500xCompressor: Generalized Prompt Compression for Large Language Models
ACL 2025
Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors
IJCAI 2025
“Give Me BF16 or Give Me Death”? Accuracy-Performance Trade-Offs in LLM Quantization
ACL 2025
CSPLADE: Learned Sparse Retrieval with Causal Language Models
IJCNLP 2025
AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model
ICCV 2025
MUNBa: Machine Unlearning via Nash Bargaining
ICCV 2025
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
COLING 2025
Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings
COLING 2025
Efficient Vocabulary Reduction for Small Language Models
COLING 2025
VRCP: Vocabulary Replacement Continued Pretraining for Efficient Multilingual Language Models
COLING 2025
DadmaTools V2: an Adapter-Based Natural Language Processing Toolkit for the Persian Language
COLING 2025
AAIG at GenAI Detection Task 1: Exploring Syntactically-Aware, Resource-Efficient Small Autoregressive Decoders for AI Content Detection
COLING 2025
How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?
NAACL 2025
As easy as PIE: understanding when pruning causes language models to disagree
NAACL 2025
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models
NAACL 2025
Large Language Models Are Overparameterized Text Encoders
NAACL 2025
GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs
ACL 2025
QPruner: Probabilistic Decision Quantization for Structured Pruning in Large Language Models
NAACL 2025
SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models
NAACL 2025
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
ICCV 2025
Quaff: Quantized Parameter-Efficient Fine-Tuning under Outlier Spatial Stability Hypothesis
ACL 2025
TAGCOS: Task-agnostic Gradient Clustered Coreset Selection for Instruction Tuning Data
NAACL 2025
Boost Embodied AI Models with Robust Compression Boundary
IJCAI 2025
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation
NAACL 2025
<
1
2
3
4
5
…
61
>