Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Optimization & Theory
Deep Learning
›
Optimization & Theory
›
Model Compression
1674 directly classified papers
Papers per year
2012: 1
2013: 2
2014: 2
2015: 7
2016: 9
2017: 27
2018: 51
2019: 79
2020: 189
2021: 165
2022: 206
2023: 207
2024: 325
2025: 399
2026: 5
Papers
Balanced Sparsity for Efficient DNN Inference on GPU
AAAI 2019
RNN Architecture Learning with Sparse Regularization
EMNLP 2019
Small and Practical BERT Models for Sequence Labeling
EMNLP 2019
Towards Optimal Structured CNN Pruning via Generative Adversarial Learning
CVPR 2019
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
CVPR 2019
Fully Quantized Network for Object Detection
CVPR 2019
Compressing Convolutional Neural Networks via Factorized Convolutional Filters
CVPR 2019
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
CVPR 2019
Learning to Quantize Deep Networks by Optimizing Quantization Intervals With Task Loss
CVPR 2019
HetConv: Heterogeneous Kernel-Based Convolutions for Deep CNNs
CVPR 2019
Composite Binary Decomposition Networks
AAAI 2019
Compressing Recurrent Neural Networks with Tensor Ring for Action Recognition
AAAI 2019
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
CVPR 2019
Centripetal SGD for Pruning Very Deep Convolutional Networks With Complicated Structure
CVPR 2019
Normalization Helps Training of Quantized LSTM
NIPS 2019
MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization
NIPS 2019
Global Sparse Momentum SGD for Pruning Very Deep Neural Networks
NIPS 2019
PowerSGD: Practical Low-Rank Gradient Compression for Distributed Optimization
NIPS 2019
Post training 4-bit quantization of convolutional networks for rapid-deployment
NIPS 2019
Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks
NIPS 2019
Positive-Unlabeled Compression on the Cloud
NIPS 2019
Network Pruning via Transformable Architecture Search
NIPS 2019
Efficient Neural Network Compression
CVPR 2019
Simultaneously Optimizing Weight and Quantizer of Ternary Neural Network Using Truncated Gaussian Approximation
CVPR 2019
Learning Channel-Wise Interactions for Binary Convolutional Neural Networks
CVPR 2019
<
1
…
62
63
64
…
67
>