Online Learning: Beyond Regret

Alexander Rakhlin; Karthik Sridharan; Ambuj Tewari

2011 COLT COLT 2011

Online Learning: Beyond Regret

Abstract

We study online learnability of a wide class of problems, extending the results of Rakhlin et al. (2010a) to general notions of performance measure well beyond external regret. Our framework simultaneously captures such well-known notions as internal and general $\Phi$-regret, learning with non-additive global cost functions, Blackwell’s approachability, calibration of forecasters, and more. We show that learnability in all these situations is due to control of the same three quantities: a martingale convergence term, a term describing the ability to perform well if future is known, and a generalization of sequential Rademacher complexity, studied in Rakhlin et al. (2010a). Since we directly study complexity of the problem instead of focusing on efficient algorithms, we are able to improve and extend many known results which have been previously derived via an algorithmic construction.

🚀 Conference Pioneer — COLT 2011

🌱 Topic Pioneer — Learning Paradigms

🧭 Keyword Pioneer — sequential rademacher complexity

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Mathematics & Optimization

📈 Trend Setter — Game Theory

🐣 Hot Topic Early Bird — regret minimization

Authors

Alexander Rakhlin , Karthik Sridharan , Ambuj Tewari

Topics

Artificial Intelligence > Learning Paradigms Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Theory Machine Learning > Learning Paradigms Machine Learning > Learning Types > Online Learning Mathematics & Optimization > Optimization > Game Theory

Keywords

online learning regret minimization blackwell approachability sequential rademacher complexity internal regret

Download PDF

Related papers

Competitive Closeness Testing 2011

Bandits, Query Learning, and the Haystack Dimension 2011

Minimax Policies for Combinatorial Prediction Games 2011

Sample Complexity Bounds for Differentially Private Learning 2011

Multiclass Learnability and the ERM principle 2011