Artificial Intelligence › Core AI ›

Model Compression

1928 directly classified papers

Papers per year

Papers

Rate Distortion For Model Compression:From Theory To Practice ICML 2019

Binarized Neural Networks for Resource-Efficient Hashing with Minimizing Quantization Loss IJCAI 2019

Cooperative Pruning in Cross-Domain Deep Neural Network Compression IJCAI 2019

Play and Prune: Adaptive Filter Pruning for Deep Model Compression IJCAI 2019

COP: Customized Deep Model Compression via Regularized Correlation-Based Filter-Level Pruning IJCAI 2019

Compression of End-to-End Models INTERSPEECH 2018

Dynamically Hierarchy Revolution: DirNet for Compressing Recurrent Neural Network on Mobile Devices IJCAI 2018

Improving Deep Neural Network Sparsity through Decorrelation Regularization IJCAI 2018

Compact Personalized Models for Neural Machine Translation EMNLP 2018

OpenNMT System Description for WNMT 2018: 800 words/sec on a single-core CPU ACL 2018

TVM: An Automated End-to-End Optimizing Compiler for Deep Learning OSDI 2018

WSNet: Compact and Efficient Networks Through Weight Sampling ICML 2018

A Simple Cache Model for Image Recognition NIPS 2018

Verifiable Reinforcement Learning via Policy Extraction NIPS 2018

Scalable methods for 8-bit training of neural networks NIPS 2018

Marian: Cost-effective High-Quality Neural Machine Translation in C++ ACL 2018

FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network NIPS 2018

GroupReduce: Block-Wise Low-Rank Approximation for Neural Language Model Shrinking NIPS 2018

Scaling provable adversarial defenses NIPS 2018

PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning CVPR 2018

Shift: A Zero FLOP, Zero Parameter Alternative to Spatial Convolutions CVPR 2018

SecureNets: Secure Inference of Deep Neural Networks on an Untrusted Cloud ACML 2018

Towards Binary-Valued Gates for Robust LSTM Training ICML 2018

Efficient DNN Neuron Pruning by Minimizing Layer-wise Nonlinear Reconstruction Error IJCAI 2018

Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations JMLR 2018