Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Kevin Scaman; Francis Bach; Sébastien Bubeck; Laurent Massoulié; Yin Tat Lee

2018 NIPS NeurIPS 2018

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Abstract

In this work, we consider the distributed optimization of non-smooth convex functions using a network of computing units. We investigate this problem under two regularity assumptions: (1) the Lipschitz continuity of the global objective function, and (2) the Lipschitz continuity of local individual functions. Under the local regularity assumption, we provide the first optimal first-order decentralized algorithm called multi-step primal-dual (MSPD) and its corresponding optimal convergence rate. A notable aspect of this result is that, for non-smooth functions, while the dominant term of the error is in $O(1/\sqrt{t})$, the structure of the communication network only impacts a second-order term in $O(1/t)$, where $t$ is time. In other words, the error due to limits in communication resources decreases at a fast rate even in the case of non-strongly-convex objective functions. Under the global regularity assumption, we provide a simple yet efficient algorithm called distributed randomized smoothing (DRS) based on a local smoothing of the objective function, and show that DRS is within a $d^{1/4}$ multiplicative factor of the optimal convergence rate, where $d$ is the underlying dimension.

🌉 Interdisciplinary Bridge — Computer Science and Machine Learning and Mathematics & Optimization

📈 Trend Setter — Distributed Learning

🧭 Keyword Pioneer — randomized smoothing

🐣 Hot Topic Early Bird — randomized smoothing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kevin Scaman , Francis Bach , Sébastien Bubeck , Laurent Massoulié , Yin Tat Lee

Topics

Machine Learning > Optimization & Theory > Distributed Learning Mathematics & Optimization > Optimization > Continuous Optimization Computer Science > Systems > Distributed Systems Mathematics & Optimization > Optimization > Convex Optimization Machine Learning > Learning Types > Distributed Learning Mathematics & Optimization > Optimization > Distributed Optimization

Keywords

distributed optimization primal-dual algorithm network optimization lipschitz continuity convergence rate primal-dual method randomized smoothing decentralized algorithm non-smooth convex non-smooth convex function

Download PDF

Related papers

Maximum Causal Tsallis Entropy Imitation Learning 2018

Recurrent World Models Facilitate Policy Evolution 2018

Bandit Learning in Concave N-Person Games 2018

Algorithmic Assurance: An Active Approach to Algorithmic Testing using Bayesian Optimisation 2018

PAC-Bayes bounds for stable algorithms with instance-dependent priors 2018