Better Algorithms for Stochastic Bandits with Adversarial Corruptions

Anupam Gupta; Tomer Koren; Kunal Talwar

2019 COLT COLT 2019

Better Algorithms for Stochastic Bandits with Adversarial Corruptions

Abstract

We study the stochastic multi-armed bandits problem in the presence of adversarial corruption. We present a new algorithm for this problem whose regret is nearly optimal, substantially improving upon previous work. Our algorithm is agnostic to the level of adversarial contamination and can tolerate a significant amount of corruption with virtually no degradation in performance.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Anupam Gupta , Tomer Koren , Kunal Talwar

Topics

Machine Learning > Learning Types > Adversarial Learning Mathematics & Optimization > Optimization > Online Algorithms

Keywords

multi-armed bandit regret bound online algorithm adversarial corruption

Download PDF

Related papers

Inference under Information Constraints: Lower Bounds from Chi-Square Contraction 2019

Learning in Non-convex Games with an Optimization Oracle 2019

Learning to Prune: Speeding up Repeated Computations 2019

A Universal Algorithm for Variational Inequalities Adaptive to Smoothness and Noise 2019

Learning Two Layer Rectified Neural Networks in Polynomial Time 2019