Training Deep Models Faster with Robust, Approximate Importance Sampling

Tyler B Johnson; Carlos Guestrin

2018 NIPS NeurIPS 2018

Training Deep Models Faster with Robust, Approximate Importance Sampling

Abstract

In theory, importance sampling speeds up stochastic gradient algorithms for supervised learning by prioritizing training examples. In practice, the cost of computing importances greatly limits the impact of importance sampling. We propose a robust, approximate importance sampling procedure (RAIS) for stochastic gradient de- scent. By approximating the ideal sampling distribution using robust optimization, RAIS provides much of the benefit of exact importance sampling with drastically reduced overhead. Empirically, we find RAIS-SGD and standard SGD follow similar learning curves, but RAIS moves faster through these paths, achieving speed-ups of at least 20% and sometimes much more.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

📈 Trend Setter — Stochastic Methods

🧭 Keyword Pioneer — training acceleration

🐣 Hot Topic Early Bird — robust optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Tyler B Johnson , Carlos Guestrin

Topics

Machine Learning > Optimization & Theory > Optimization Machine Learning > Optimization & Theory > Stochastic Methods Deep Learning > Optimization & Theory > Optimization Deep Learning > Optimization & Theory > Stochastic Methods Machine Learning > Learning Types > Optimization

Keywords

stochastic gradient descent robust optimization importance sampling training acceleration approximate method gradient algorithm

Download PDF

Related papers

Maximum Causal Tsallis Entropy Imitation Learning 2018

Recurrent World Models Facilitate Policy Evolution 2018

Bandit Learning in Concave N-Person Games 2018

Algorithmic Assurance: An Active Approach to Algorithmic Testing using Bayesian Optimisation 2018

PAC-Bayes bounds for stable algorithms with instance-dependent priors 2018