Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
ACL 2025
StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data
ICCV 2025
Beyond Low-Rank Tuning: Model Prior-Guided Rank Allocation for Effective Transfer in Low-Data and Large-Gap Regimes.
ICCV 2025
Dense2MoE: Restructuring Diffusion Transformer to MoE for Efficient Text-to-Image Generation
ICCV 2025
RED-CT: A Systems Design Methodology for Using LLM-labeled Data to Train and Deploy Edge Linguistic Classifiers
COLING 2025
Best Practices for Distilling Large Language Models into BERT for Web Search Ranking
COLING 2025
Lightweight Safety Guardrails Using Fine-tuned BERT Embeddings
COLING 2025
Efficient Vocabulary Reduction for Small Language Models
COLING 2025
DadmaTools V2: an Adapter-Based Natural Language Processing Toolkit for the Persian Language
COLING 2025
AAIG at GenAI Detection Task 1: Exploring Syntactically-Aware, Resource-Efficient Small Autoregressive Decoders for AI Content Detection
COLING 2025
Extracting General-use Transformers for Low-resource Languages via Knowledge Distillation
COLING 2025
VRCP: Vocabulary Replacement Continued Pretraining for Efficient Multilingual Language Models
COLING 2025
Variance-Based Pruning for Accelerating and Compressing Trained Networks
ICCV 2025
Outlier-Aware Post-Training Quantization for Image Super-Resolution
ICCV 2025
Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation
NAACL 2025
LangCompress: Language-Aware Compression of Large Language Models
IJCNLP 2025
General Compression Framework for Efficient Transformer Object Tracking
ICCV 2025
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
ICCV 2025
Interpreting the Effects of Quantization on LLMs
IJCNLP 2025
AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model
ICCV 2025
Aligning Sizes of Intermediate Layers by LoRA Adapter for Knowledge Distillation
NAACL 2025
As easy as PIE: understanding when pruning causes language models to disagree
NAACL 2025
Verifiable Format Control for Large Language Model Generations
NAACL 2025
SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models
NAACL 2025
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models
NAACL 2025
<
1
2
3
4
5
…
61
>