Backstitch: Counteracting Finite-Sample Bias via Negative Steps

Yiming Wang; Vijayaditya Peddinti; Hainan Xu; XiaoHui Zhang; Daniel Povey; Sanjeev Khudanpur

2017 INTERSPEECH INTERSPEECH 2017

Backstitch: Counteracting Finite-Sample Bias via Negative Steps

Abstract

In this paper we describe a modification to Stochastic Gradient Descent (SGD) that improves generalization to unseen data. It consists of doing two steps for each minibatch: a backward step with a small negative learning rate, followed by a forward step with a larger learning rate. The idea was initially inspired by ideas from adversarial training, but we show that it can be viewed as a crude way of canceling out certain systematic biases that come from training on finite data sets. The method gives ~ 10% relative improvement over our best acoustic models based on lattice-free MMI, across multiple datasets with 100–300 hours of data.

🧭 Keyword Pioneer — finite sample bia

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yiming Wang , Vijayaditya Peddinti , Hainan Xu , XiaoHui Zhang , Daniel Povey , Sanjeev Khudanpur

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Neural Network Optimization

Keywords

stochastic gradient descent adversarial training neural network optimization acoustic model finite sample bia

Download PDF

Related papers

Description of the Munich-Passau Snore Sound Corpus (MPSSC) 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks 2017

Building Audio-Visual Phonetically Annotated Arabic Corpus for Expressive Text to Speech 2017

A Comparison of Danish Listeners’ Processing Cost in Judging the Truth Value of Norwegian, Swedish, and English Sentences 2017