Online Non-Convex Learning: Following the Perturbed Leader is Optimal

Arun Sai Suggala; Praneeth Netrapalli

2020 ALT ALT 2020

Online Non-Convex Learning: Following the Perturbed Leader is Optimal

Abstract

We study the problem of online learning with non-convex losses, where the learner has access to an offline optimization oracle. We show that the classical Follow the Perturbed Leader (FTPL) algorithm achieves optimal regret rate of $O(T^{-1/2})$ in this setting. This improves upon the previous best-known regret rate of $O(T^{-1/3})$ for FTPL. We further show that an optimistic variant of FTPL achieves better regret bounds when the sequence of losses encountered by the learner is “predictable”.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Arun Sai Suggala , Praneeth Netrapalli

Topics

Mathematics & Optimization > Optimization > Online Algorithms

Keywords

online learning non-convex optimization follow the perturbed leader regret bound

Download PDF

Related papers

On Learnability wih Computable Learners 2020

Interactive Learning of a Dynamic Structure 2020

A Non-Trivial Algorithm Enumerating Relevant Features over Finite Fields 2020

Finding Robust Nash equilibria 2020

Approximate Representer Theorems in Non-reflexive Banach Spaces 2020