Minimax Learning of Ergodic Markov Chains

Geoﬀrey Wolfer; Aryeh Kontorovich

2019 ALT ALT 2019

Minimax Learning of Ergodic Markov Chains

Abstract

We compute the finite-sample minimax (modulo logarithmic factors) sample complexity of learning the parameters of a finite Markov chain from a single long sequence of states. Our error metric is a natural variant of total variation. The sample complexity necessarily depends on the spectral gap and minimal stationary probability of the unknown chain, for which there are known finite-sample estimators with fully empirical confidence intervals. To our knowledge, this is the first PAC-type result with nearly matching (up to logarithmic factors) upper and lower bounds for learning, in any metric, in the context of Markov chains.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Geoﬀrey Wolfer , Aryeh Kontorovich

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Stochastic Processes Mathematics & Optimization > Mathematics > Probability

Keywords

sample complexity total variation finite-sample bound minimax learning ergodic markov chain

Download PDF

Related papers

An Exponential Efron-Stein Inequality for $L_q$ Stable Learning Rules 2019

Online Influence Maximization with Local Observations 2019

Stochastic Nonconvex Optimization with Large Minibatches 2019

Average-Case Information Complexity of Learning 2019

Algorithmic Learning Theory 2019: Preface 2019