Online Optimization : Competing with Dynamic Comparators

Ali Jadbabaie; Alexander Rakhlin; Shahin Shahrampour; Karthik Sridharan

2015 AISTATS AISTATS 2015

Online Optimization : Competing with Dynamic Comparators

Abstract

Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop prediction methods that perform well against complex benchmarks. In this paper, we address these two directions together. We present a fully adaptive method that competes with dynamic benchmarks in which regret guarantee scales with regularity of the sequence of cost functions and comparators. Notably, the regret bound adapts to the smaller complexity measure in the problem environment. Finally, we apply our results to drifting zero-sum, two-player games where both players achieve no regret guarantees against best sequences of actions in hindsight.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — dynamic comparator

🐣 Hot Topic Early Bird — stochastic process

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ali Jadbabaie , Alexander Rakhlin , Shahin Shahrampour , Karthik Sridharan

Topics

Machine Learning > Optimization & Theory > Learning Theory Mathematics & Optimization > Optimization > Online Algorithms

Keywords

online optimization stochastic process regret bound two-player game dynamic comparator

Download PDF

Related papers

Near-optimal max-affine estimators for convex regression 2015

Sparse Solutions to Nonnegative Linear Systems and Applications 2015

Dimensionality estimation without distances 2015

The Security of Latent Dirichlet Allocation 2015

Robust sketching for multiple square-root LASSO problems 2015