No Internal Regret via Neighborhood Watch

Dean Foster; Alexander Rakhlin

2012 AISTATS AISTATS 2012

No Internal Regret via Neighborhood Watch

Abstract

We present an algorithm which attains O(\sqrtT) internal (and thus external) regret for finite games with partial monitoring under the local observability condition. Recently, this condition has been shown by Bartok, Pal, and Szepesvari (2011) to imply the O(\sqrtT) rate for partial monitoring games against an i.i.d. opponent, and the authors conjectured that the same holds for non-stochastic adversaries. Our result is in the affirmative, and it completes the characterization of possible rates for finite partial-monitoring games, an open question stated by Cesa-Bianchi, Lugosi, and Stoltz (2006). Our regret guarantees also hold for the more general model of partial monitoring with random signals.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐣 Hot Topic Early Bird — game theory

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dean Foster , Alexander Rakhlin

Topics

Machine Learning > Optimization & Theory > Learning Theory Mathematics & Optimization > Optimization > Online Algorithms

Keywords

online learning game theory partial monitoring internal regret external regret

Download PDF

Related papers

Minimax rates for homology inference 2012

Scalable Personalization of Long-Term Physiological Monitoring: Active Learning Methodologies for Epileptic Seizure Onset Detection 2012

Adaptive Metropolis with Online Relabeling 2012

Joint Learning of Words and Meaning Representations for Open-Text Semantic Parsing 2012

Bayesian regularization of non-homogeneous dynamic Bayesian networks by globally coupling interaction parameters 2012