Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

Maryam Aziz; Jesse Anderton; Emilie Kaufmann; Javed Aslam

2018 ALT ALT 2018

Pure Exploration in Infinitely-Armed Bandit Models with Fixed-Confidence

Abstract

We consider the problem of near-optimal arm identification in the fixed confidence setting of the infinitely armed bandit problem when nothing is known about the arm reservoir distribution. We (1) introduce a PAC-like framework within which to derive and cast results; (2) derive a sample complexity lower bound for near-optimal arm identification; (3) propose an algorithm that identifies a nearly-optimal arm with high probability and derive an upper bound on its sample complexity which is within a log factor of our lower bound; and (4) discuss whether our $\log^2 \frac{1}{δ}$ dependence is inescapable for “two-phase” (select arms first, identify the best later) algorithms in the infinite setting. This work permits the application of bandit models to a broader class of problems where fewer assumptions hold.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Maryam Aziz , Jesse Anderton , Emilie Kaufmann , Javed Aslam

Topics

Mathematics & Optimization > Optimization > Online Algorithms

Keywords

sample complexity multi-armed bandit pure exploration arm identification fixed confidence

Download PDF

Related papers

Dimension-free Information Concentration via Exp-Concavity 2018

Multi-task {K}ernel {L}earning Based on {P}robabilistic {L}ipschitzness 2018

An Adaptive Strategy for Active Learning with Smooth Decision Boundary 2018

Corrupt Bandits for Preserving Local Privacy 2018

Online Learning of Combinatorial Objects via Extended Formulation 2018