A Second-Order Method for Stochastic Bandit Convex Optimisation

Tor Lattimore; András György

2023 COLT COLT 2023

A Second-Order Method for Stochastic Bandit Convex Optimisation

Abstract

We introduce a simple and efficient algorithm for unconstrained zeroth-order stochastic convex bandits and prove its regret is at most (1 + r/d)[d^1.5 sqrt(n) + d^3] polylog(n, d, r) where n is the horizon, d the dimension and r is the radius of a known ball containing the minimiser of the loss.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Tor Lattimore , András György

Topics

Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Stochastic Methods

Keywords

stochastic convex optimization regret bound bandit convex optimization zeroth-order optimization second-order method

Download PDF

Related papers

Towards a Complete Analysis of Langevin Monte Carlo: Beyond Poincaré Inequality 2023

Improved Discretization Analysis for Underdamped Langevin Monte Carlo 2023

Convergence of AdaGrad for Non-convex Objectives: Simple Proofs and Relaxed Assumptions 2023

Stability and Generalization of Stochastic Optimization with Nonconvex and Nonsmooth Problems 2023

Online Learning and Solving Infinite Games with an ERM Oracle 2023