Average-Case Information Complexity of Learning

Ido Nachum; Amir Yehudayoff

2019 ALT ALT 2019

Average-Case Information Complexity of Learning

Abstract

How many bits of information are revealed by a learning algorithm for a concept class of VC-dimension $d$? Previous works have shown that even for $d=1$ the amount of information may be unbounded (tend to $\infty$ with the universe size). Can it be that all concepts in the class require leaking a large amount of information? We show that typically concepts do not require leakage. There exists a proper learning algorithm that reveals $O(d)$ bits of information for most concepts in the class. This result is a special case of a more general phenomenon we explore. If there is a low information learner when the algorithm \emph{knows} the underlying distribution on inputs, then there is a learner that reveals little information on an average concept \emph{without knowing} the distribution on inputs.

🧭 Keyword Pioneer — average concept

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Ido Nachum , Amir Yehudayoff

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Information Theory

Keywords

learning algorithm proper learning information complexity average concept

Download PDF

Related papers

An Exponential Efron-Stein Inequality for $L_q$ Stable Learning Rules 2019

Online Influence Maximization with Local Observations 2019

Stochastic Nonconvex Optimization with Large Minibatches 2019

Minimax Learning of Ergodic Markov Chains 2019

Algorithmic Learning Theory 2019: Preface 2019