Optimal Algorithms for Smooth and Strongly Convex Distributed Optimization in Networks

Kevin Scaman; Francis Bach; Sébastien Bubeck; Yin Tat Lee; Laurent Massoulié

2017 ICML ICML 2017

Optimal Algorithms for Smooth and Strongly Convex Distributed Optimization in Networks

Abstract

In this paper, we determine the optimal convergence rates for strongly convex and smooth distributed optimization in two settings: centralized and decentralized communications over a network. For centralized (i.e. master/slave) algorithms, we show that distributing Nesterov’s accelerated gradient descent is optimal and achieves a precision $\varepsilon > 0$ in time $O(\sqrt{\kappa_g}(1+\Delta\tau)\ln(1/\varepsilon))$, where $\kappa_g$ is the condition number of the (global) function to optimize, $\Delta$ is the diameter of the network, and $\tau$ (resp. $1$) is the time needed to communicate values between two neighbors (resp. perform local computations). For decentralized algorithms based on gossip, we provide the first optimal algorithm, called the multi-step dual accelerated (MSDA) method, that achieves a precision $\varepsilon > 0$ in time $O(\sqrt{\kappa_l}(1+\frac{\tau}{\sqrt{\gamma}})\ln(1/\varepsilon))$, where $\kappa_l$ is the condition number of the local functions and $\gamma$ is the (normalized) eigengap of the gossip matrix used for communication between nodes. We then verify the efficiency of MSDA against state-of-the-art methods for two problems: least-squares regression and classification by logistic regression.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — nesterov accelerated gradient

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Data Science & Analytics, Machine Learning, Mathematics & Optimization, Reinforcement Learning

📈 Trend Setter — Distributed Learning

Authors

Kevin Scaman , Francis Bach , Sébastien Bubeck , Yin Tat Lee , Laurent Massoulié

Topics

Machine Learning > Optimization & Theory > Distributed Learning Mathematics & Optimization > Optimization > Continuous Optimization Mathematics & Optimization > Optimization > Stochastic Methods Mathematics & Optimization > Optimization > Distributed Learning Mathematics & Optimization > Optimization > Convex Optimization

Keywords

convex optimization distributed optimization least squares regression network optimization strongly convex optimization nesterov acceleration nesterov accelerated gradient gossip algorithm

Download PDF

Related papers

Bottleneck Conditional Density Estimation 2017

Constrained Policy Optimization 2017

Near-Optimal Design of Experiments via Regret Minimization 2017

Input Convex Neural Networks 2017

An Efficient, Sparsity-Preserving, Online Algorithm for Low-Rank Approximation 2017