A Delay-tolerant Proximal-Gradient Algorithm for Distributed Learning

Konstantin Mishchenko; Franck Iutzeler; Jérôme Malick; Massih-Reza Amini

2018 ICML ICML 2018

A Delay-tolerant Proximal-Gradient Algorithm for Distributed Learning

Abstract

Distributed learning aims at computing high-quality models by training over scattered data. This covers a diversity of scenarios, including computer clusters or mobile agents. One of the main challenges is then to deal with heterogeneous machines and unreliable communications. In this setting, we propose and analyze a flexible asynchronous optimization algorithm for solving nonsmooth learning problems. Unlike most existing methods, our algorithm is adjustable to various levels of communication costs, machines computational powers, and data distribution evenness. We prove that the algorithm converges linearly with a fixed learning rate that does not depend on communication delays nor on the number of machines. Although long delays in communication may slow down performance, no delay can break convergence.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — communication delay

🐣 Hot Topic Early Bird — distributed learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

📈 Trend Setter — Federated Learning

Authors

Konstantin Mishchenko , Franck Iutzeler , Jérôme Malick , Massih-Reza Amini

Topics

Machine Learning > Optimization & Theory > Distributed Learning Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Stochastic Methods Machine Learning > Learning Types > Federated Learning Mathematics & Optimization > Optimization > Distributed Optimization

Keywords

stochastic optimization distributed learning asynchronous optimization nonsmooth optimization proximal gradient method linear convergence proximal gradient communication delay

Download PDF

Related papers

Rectify Heterogeneous Models with Semantic Mapping 2018

Bayesian Optimization of Combinatorial Structures 2018

The Well-Tempered Lasso 2018

Approximation Algorithms for Cascading Prediction Models 2018

Classification from Pairwise Similarity and Unlabeled Data 2018