Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Application Areas
Machine Learning
›
Application Areas
›
Model Compression
1503 directly classified papers
Papers per year
2006: 2
2010: 2
2011: 1
2013: 5
2014: 3
2015: 4
2016: 3
2017: 14
2018: 36
2019: 55
2020: 117
2021: 171
2022: 172
2023: 175
2024: 331
2025: 402
2026: 10
Papers
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization
ICCV 2025
VRCP: Vocabulary Replacement Continued Pretraining for Efficient Multilingual Language Models
COLING 2025
Scaling Action Detection: AdaTAD++ with Transformer-Enhanced Temporal-Spatial Adaptation
ICCV 2025
Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer
ACL 2025
MobiLoRA: Accelerating LoRA-based LLM Inference on Mobile Devices via Context-aware KV Cache Optimization
ACL 2025
Scaling Laws and Efficient Inference for Ternary Language Models
ACL 2025
Balcony: A Lightweight Approach to Dynamic Inference of Generative Language Models
EMNLP 2025
ImPart: Importance-Aware Delta-Sparsification for Improved Model Compression and Merging in LLMs
ACL 2025
ZipVL: Accelerating Vision-Language Models through Dynamic Token Sparsity
ICCV 2025
LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering
ACL 2025
Grouped Speculative Decoding for Autoregressive Image Generation
ICCV 2025
Efficient Federated Learning via Clients-to-Server Knowledge Distillation (Student Abstract)
AAAI 2025
FOLDER: Accelerating Multi-Modal Large Language Models with Enhanced Performance
ICCV 2025
Multilingual Iterative Model Pruning: What Matters?
IJCNLP 2025
LoRA-FAIR: Federated LoRA Fine-Tuning with Aggregation and Initialization Refinement
ICCV 2025
Controlled Low-Rank Adaptation with Subspace Regularization for Continued Training on Large Language Models
ACL 2025
AIRA: Activation-Informed Low-Rank Adaptation for Large Models
ICCV 2025
Multi-Attribute Steering of Language Models via Targeted Intervention
ACL 2025
QuantCache: Adaptive Importance-Guided Quantization with Hierarchical Latent and Layer Caching for Video Generation
ICCV 2025
Maximizing the Effectiveness of Larger BERT Models for Compression
ACL 2025
TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning
ICCV 2025
LangCompress: Language-Aware Compression of Large Language Models
IJCNLP 2025
Staining and Locking Computer Vision Models Without Retraining
ICCV 2025
AnalyticKWS: Towards Exemplar-Free Analytic Class Incremental Learning for Small-footprint Keyword Spotting
ACL 2025
APHQ-ViT: Post-Training Quantization with Average Perturbation Hessian Based Reconstruction for Vision Transformers
CVPR 2025
<
1
…
10
11
12
…
61
>