(Nearly) Optimal Algorithms for Private Online Learning in Full-information and Bandit Settings

Abhradeep Guha Thakurta; Adam Smith

2013 NIPS NeurIPS 2013

(Nearly) Optimal Algorithms for Private Online Learning in Full-information and Bandit Settings

Abstract

We provide a general technique for making online learning algorithms differentially private, in both the full information and bandit settings. Our technique applies to algorithms that aim to minimize a \emph{convex} loss function which is a sum of smaller convex loss terms, one for each data point. We modify the popular \emph{mirror descent} approach, or rather a variant called \emph{follow the approximate leader}. The technique leads to the first nonprivate algorithms for private online learning in the bandit setting. In the full information setting, our algorithms improve over the regret bounds of previous work. In many cases, our algorithms (in both settings) matching the dependence on the input length, $T$, of the \emph{optimal nonprivate} regret bounds up to logarithmic factors in $T$. Our algorithms require logarithmic space and update time.

🧭 Keyword Pioneer — follow the approximate leader

🐣 Hot Topic Early Bird — differential privacy

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🌉 Interdisciplinary Bridge — Machine Learning and Security & Privacy

📈 Trend Setter — Differential Privacy

Authors

Abhradeep Guha Thakurta , Adam Smith

Topics

Machine Learning > Optimization & Theory > Optimization Machine Learning > Application Areas > Privacy Machine Learning > Learning Types > Online Learning Security & Privacy > Differential Privacy Machine Learning > Learning Types > Multi-Armed Bandits

Keywords

differential privacy online learning convex optimization convex loss mirror descent bandit setting follow the approximate leader regret bound

Download PDF

Related papers

Latent Structured Active Learning 2013

On Flat versus Hierarchical Classification in Large-Scale Taxonomies 2013

Generalized Method-of-Moments for Rank Aggregation 2013

Third-Order Edge Statistics: Contour Continuation, Curvature, and Cortical Connections 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent 2013