Mirror Descent Meets Fixed Share (and feels no regret)

Nicolò Cesa-bianchi; Pierre Gaillard; Gabor Lugosi; Gilles Stoltz

2012 NIPS NeurIPS 2012

Mirror Descent Meets Fixed Share (and feels no regret)

Abstract

Mirror descent with an entropic regularizer is known to achieve shifting regret bounds that are logarithmic in the dimension. This is done using either a carefully designed projection or by a weight sharing technique. Via a novel unified analysis, we show that these two approaches deliver essentially equivalent bounds on a notion of regret generalizing shifting, adaptive, discounted, and other related regrets. Our analysis also captures and extends the generalized weight sharing technique of Bousquet and Warmuth, and can be refined in several ways, including improvements for small losses and adaptive tuning of parameters.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — entropic regularizer

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

📈 Trend Setter — Game Theory

🐣 Hot Topic Early Bird — online algorithm

Authors

Nicolò Cesa-bianchi , Pierre Gaillard , Gabor Lugosi , Gilles Stoltz

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Online Algorithms Machine Learning > Learning Types > Online Learning Mathematics & Optimization > Optimization > Optimization Artificial Intelligence > Core AI > Game Theory Mathematics & Optimization > Optimization > Convex Optimization

Keywords

online learning convex optimization mirror descent weight sharing entropic regularizer shifting regret regret bound online algorithm

Download PDF

Related papers

Kernel Hyperalignment 2012

Fused sparsity and robust estimation for linear models with unknown variance 2012

Slice sampling normalized kernel-weighted completely random measure mixture models 2012

Scaling MPE Inference for Constrained Continuous Markov Random Fields with Consensus Optimization 2012

Matrix reconstruction with the local max norm 2012