(ε, u)-Adaptive Regret Minimization in Heavy-Tailed Bandits

Gianmarco Genalti; Lupo Marsigli; Nicola Gatti; Alberto Maria Metelli

2024 COLT COLT 2024

(ε, u)-Adaptive Regret Minimization in Heavy-Tailed Bandits

Abstract

Heavy-tailed distributions naturally arise in several settings, from finance to telecommunications. While regret minimization under subgaussian or bounded rewards has been widely studied, learning with heavy-tailed distributions only gained popularity over the last decade. In this paper, we consider the setting in which the reward distributions have finite absolute raw moments of maximum order $1+\epsilon$, uniformly bounded by a constant $u<+\infty$, for some $\epsilon \in (0,1]$. In this setting, we study the regret minimization problem when $\epsilon$ and $u$ are unknown to the learner and it has to adapt. First, we show that adaptation comes at a cost and derive two negative results proving that the same regret guarantees of the non-adaptive case cannot be achieved with no further assumptions. Then, we devise and analyze a fully data-driven trimmed mean estimator and propose a novel adaptive regret minimization algorithm, \texttt{AdaR-UCB}, that leverages such an estimator. Finally, we show that \texttt{AdaR-UCB} is the first algorithm that, under a known distributional assumption, enjoys regret guarantees nearly matching those of the non-adaptive heavy-tailed case.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — trimmed mean estimator

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Gianmarco Genalti , Lupo Marsigli , Nicola Gatti , Alberto Maria Metelli

Topics

Machine Learning > Optimization & Theory > Learning Theory Mathematics & Optimization > Optimization > Stochastic Methods

Keywords

stochastic optimization regret minimization adaptive algorithm heavy-tailed distribution bandit algorithm trimmed mean estimator

Download PDF

Related papers

Exact Mean Square Linear Stability Analysis for SGD 2024

Optimistic Information Directed Sampling 2024

Robust Distribution Learning with Local and Global Adversarial Corruptions (extended abstract) 2024

Depth Separation in Norm-Bounded Infinite-Width Neural Networks 2024

The Sample Complexity of Simple Binary Hypothesis Testing 2024