Nearly tight sample complexity bounds for learning mixtures of Gaussians via sample compression schemes

Hassan Ashtiani; Shai Ben-David; Nicholas Harvey; Christopher Liaw; Abbas Mehrabian; Yaniv Plan

2018 NIPS NeurIPS 2018

Nearly tight sample complexity bounds for learning mixtures of Gaussians via sample compression schemes

Abstract

We prove that ϴ(k d^2 / ε^2) samples are necessary and sufficient for learning a mixture of k Gaussians in R^d, up to error ε in total variation distance. This improves both the known upper bounds and lower bounds for this problem. For mixtures of axis-aligned Gaussians, we show that O(k d / ε^2) samples suffice, matching a known lower bound. The upper bound is based on a novel technique for distribution learning based on a notion of sample compression. Any class of distributions that allows such a sample compression scheme can also be learned with few samples. Moreover, if a class of distributions has such a compression scheme, then so do the classes of products and mixtures of those distributions. The core of our main result is showing that the class of Gaussians in R^d has an efficient sample compression.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

📈 Trend Setter — Theory

🐣 Hot Topic Early Bird — gaussian distribution

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hassan Ashtiani , Shai Ben-David , Nicholas Harvey , Christopher Liaw , Abbas Mehrabian , Yaniv Plan

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Mathematics & Optimization > Optimization > Theory Machine Learning > Optimization & Theory > Sample Complexity

Keywords

learning theory sample complexity sample compression distribution learning gaussian distribution gaussian mixture mixture of gaussian

Download PDF

Related papers

Maximum Causal Tsallis Entropy Imitation Learning 2018

Recurrent World Models Facilitate Policy Evolution 2018

Bandit Learning in Concave N-Person Games 2018

Algorithmic Assurance: An Active Approach to Algorithmic Testing using Bayesian Optimisation 2018

PAC-Bayes bounds for stable algorithms with instance-dependent priors 2018