Papers
Knowledge Distillation for Sequence Model
INTERSPEECH 2018
Sobolev Training for Neural Networks
NIPS 2017
Student-Teacher Training with Diverse Decision Tree Ensembles
INTERSPEECH 2017
Efficient Knowledge Distillation from an Ensemble of Teachers
INTERSPEECH 2017
Do Deep Nets Really Need to be Deep?
NIPS 2014