Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Core AI
Artificial Intelligence
›
Core AI
›
Model Compression
1928 directly classified papers
Papers per year
2013: 2
2014: 1
2015: 6
2016: 4
2017: 13
2018: 47
2019: 81
2020: 114
2021: 172
2022: 191
2023: 272
2024: 370
2025: 489
2026: 166
Papers
Rate Distortion For Model Compression:From Theory To Practice
ICML 2019
Binarized Neural Networks for Resource-Efficient Hashing with Minimizing Quantization Loss
IJCAI 2019
Cooperative Pruning in Cross-Domain Deep Neural Network Compression
IJCAI 2019
Play and Prune: Adaptive Filter Pruning for Deep Model Compression
IJCAI 2019
COP: Customized Deep Model Compression via Regularized Correlation-Based Filter-Level Pruning
IJCAI 2019
Compression of End-to-End Models
INTERSPEECH 2018
Dynamically Hierarchy Revolution: DirNet for Compressing Recurrent Neural Network on Mobile Devices
IJCAI 2018
Improving Deep Neural Network Sparsity through Decorrelation Regularization
IJCAI 2018
Compact Personalized Models for Neural Machine Translation
EMNLP 2018
OpenNMT System Description for WNMT 2018: 800 words/sec on a single-core CPU
ACL 2018
TVM: An Automated End-to-End Optimizing Compiler for Deep Learning
OSDI 2018
WSNet: Compact and Efficient Networks Through Weight Sampling
ICML 2018
A Simple Cache Model for Image Recognition
NIPS 2018
Verifiable Reinforcement Learning via Policy Extraction
NIPS 2018
Scalable methods for 8-bit training of neural networks
NIPS 2018
Marian: Cost-effective High-Quality Neural Machine Translation in C++
ACL 2018
FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network
NIPS 2018
GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking
NIPS 2018
Scaling provable adversarial defenses
NIPS 2018
PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning
CVPR 2018
Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions
CVPR 2018
SecureNets: Secure Inference of Deep Neural Networks on an Untrusted Cloud
ACML 2018
Towards Binary-Valued Gates for Robust LSTM Training
ICML 2018
Efficient DNN Neuron Pruning by Minimizing Layer-wise Nonlinear Reconstruction Error
IJCAI 2018
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
JMLR 2018
<
1
…
74
75
76
77
78
>