Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

Umut Simsekli; Cagatay Yildiz; Than Huy Nguyen; Taylan Cemgil; Gaël RICHARD

2018 ICML ICML 2018

Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization

Abstract

Recent studies have illustrated that stochastic gradient Markov Chain Monte Carlo techniques have a strong potential in non-convex optimization, where local and global convergence guarantees can be shown under certain conditions. By building up on this recent theory, in this study, we develop an asynchronous-parallel stochastic L-BFGS algorithm for non-convex optimization. The proposed algorithm is suitable for both distributed and shared-memory settings. We provide formal theoretical analysis and show that the proposed method achieves an ergodic convergence rate of ${\cal O}(1/\sqrt{N})$ ($N$ being the total number of iterations) and it can achieve a linear speedup under certain conditions. We perform several experiments on both synthetic and real datasets. The results support our theory and show that the proposed algorithm provides a significant speedup over the recently proposed synchronous distributed L-BFGS algorithm.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — stochastic quasi-newton method

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Umut Simsekli , Cagatay Yildiz , Than Huy Nguyen , Taylan Cemgil , Gaël RICHARD

Topics

Machine Learning > Optimization & Theory > Bayesian Inference Machine Learning > Optimization & Theory > Stochastic Processes Mathematics & Optimization > Optimization > Optimization Machine Learning > Core Methods > Optimization

Keywords

non-convex optimization markov chain monte carlo quasi-newton method asynchronous parallel stochastic gradient markov chain monte carlo stochastic quasi-newton method ergodic convergence

Download PDF

Related papers

Rectify Heterogeneous Models with Semantic Mapping 2018

Bayesian Optimization of Combinatorial Structures 2018

The Well-Tempered Lasso 2018

Approximation Algorithms for Cascading Prediction Models 2018

Classification from Pairwise Similarity and Unlabeled Data 2018