On Learning Distributions from their Samples

Sudeep Kamath; Alon Orlitsky; Dheeraj Pichapati; Ananda Theertha Suresh

2015 COLT COLT 2015

On Learning Distributions from their Samples

Abstract

One of the most natural and important questions in statistical learning is: how well can a distribution be approximated from its samples. Surprisingly, this question has so far been resolved for only one loss, the KL-divergence and even in this case, the estimator used is ad hoc and not well understood. We study distribution approximations for general loss measures. For \ell_2^2 we determine the best approximation possible, for \ell_1 and χ^2 we derive tight bounds on the best approximation, and when the probabilities are bounded away from zero, we resolve the question for all sufficiently smooth loss measures, thereby providing a coherent understanding of the rate at which distributions can be approximated from their samples.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐣 Hot Topic Early Bird — kl divergence

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sudeep Kamath , Alon Orlitsky , Dheeraj Pichapati , Ananda Theertha Suresh

Topics

Machine Learning > Optimization & Theory > Statistical Learning Mathematics & Optimization > Mathematics > Statistics

Keywords

kl divergence sample complexity statistical estimation distribution learning loss function

Download PDF

Related papers

Open Problem: Restricted Eigenvalue Condition for Heavy Tailed Designs 2015

Open Problem: The Oracle Complexity of Smooth Convex Optimization in Nonstandard Settings 2015

Online Learning with Feedback Graphs: Beyond Bandits 2015

Learning Overcomplete Latent Variable Models through Tensor Methods 2015

Efficient Learning of Linear Separators under Bounded Noise 2015