Altitude Training: Strong Bounds for Single-Layer Dropout

Stefan Wager; William Fithian; Sida Wang; Percy Liang

2014 NIPS NeurIPS 2014

Altitude Training: Strong Bounds for Single-Layer Dropout

Abstract

Dropout training, originally designed for deep neural networks, has been successful on high-dimensional single-layer natural language tasks. This paper proposes a theoretical explanation for this phenomenon: we show that, under a generative Poisson topic model with long documents, dropout training improves the exponent in the generalization bound for empirical risk minimization. Dropout achieves this gain much like a marathon runner who practices at altitude: once a classifier learns to perform reasonably well on training examples that have been artificially corrupted by dropout, it will do very well on the uncorrupted test set. We also show that, under similar conditions, dropout preserves the Bayes decision boundary and should therefore induce minimal bias in high dimensions.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Data Augmentation

🧭 Keyword Pioneer — poisson topic model

🐣 Hot Topic Early Bird — empirical risk minimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Stefan Wager , William Fithian , Sida Wang , Percy Liang

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Application Areas > Data Augmentation Natural Language Processing > Resources & Methods > Transfer Learning Deep Learning > Optimization & Theory > Theory Machine Learning > Learning Types > Regularization

Keywords

empirical risk minimization positive semidefinite matrix matrix completion dropout regularization generalization bound dropout training poisson topic model

Download PDF

Related papers

Information-based learning by agents in unbounded state spaces 2014

Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm 2014

Partition-wise Linear Models 2014

Active Regression by Stratification 2014

Cone-Constrained Principal Component Analysis 2014