Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Attend to Not Attended: Structure-then-Detail Token Merging for Post-training DiT Acceleration
CVPR 2025
DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization
ICCV 2025
GAP: a Global Adaptive Pruning Method for Large Language Models
EMNLP 2025
Less is More: Empowering GUI Agent with Context-Aware Simplification
ICCV 2025
AHCPTQ: Accurate and Hardware-Compatible Post-Training Quantization for Segment Anything Model
ICCV 2025
Inference-Time Diffusion Model Distillation
ICCV 2025
Lightweight and Fast Real-time Image Enhancement via Decomposition of the Spatial-aware Lookup Tables
ICCV 2025
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration
ICCV 2025
TCFG: Truncated Classifier-Free Guidance for Efficient and Scalable Text-to-Image Acceleration
ICCV 2025
Efficient Input-level Backdoor Defense on Text-to-Image Synthesis via Neuron Activation Variation
ICCV 2025
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective Layers
ICCV 2025
A Good Teacher Adapts Their Knowledge for Distillation
ICCV 2025
Pruning All-Rounder: Rethinking and Improving Inference Efficiency for Large Vision Language Models
ICCV 2025
Accelerating Diffusion Transformer via Gradient-Optimized Cache
ICCV 2025
WINS: Winograd Structured Pruning for Fast Winograd Convolution
ICCV 2025
BlueLM-V-3B: Algorithm and System Co-Design for Multimodal Large Language Models on Mobile Devices
CVPR 2025
Efficient On-Device Text Simplification for Firefox with Synthetic Data Fine-Tuning
EMNLP 2025
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning Models
EMNLP 2025
SwiftPrune: Hessian-Free Weight Pruning for Large Language Models
EMNLP 2025
Fine-tuning LLMs with Cross-Attention-based Weight Decay for Bias Mitigation
EMNLP 2025
Vicomtech@WMT 2025: Evolutionary Model Compression for Machine Translation
EMNLP 2025
FroM: Frobenius Norm-Based Data-Free Adaptive Model Merging
EMNLP 2025
Train It and Forget It: Merge Lists are Unnecessary for BPE Inference in Language Models
EMNLP 2025
Harmonizing Diverse Models: A Layer-wise Merging Strategy for Consistent Generation
EMNLP 2025
Multi-Task Pre-Finetuning of Lightweight Transformer Encoders for Text Classification and NER
EMNLP 2025
<
1
2
3
4
5
…
67
>