Convergence Guarantees for a Class of Non-convex and Non-smooth Optimization Problems

Koulik Khamaru; Martin J. Wainwright

2019 JMLR JMLR 2019

Convergence Guarantees for a Class of Non-convex and Non-smooth Optimization Problems

Abstract

We consider the problem of finding critical points of functions that are non-convex and non-smooth. Studying a fairly broad class of such problems, we analyze the behavior of three gradient-based methods (gradient descent, proximal update, and Frank-Wolfe update). For each of these methods, we establish rates of convergence for general problems, and also prove faster rates for continuous sub-analytic functions. We also show that our algorithms can escape strict saddle points for a class of non-smooth functions, thereby generalizing known results for smooth functions. Our analysis leads to a simplification of the popular CCCP algorithm, used for optimizing functions that can be written as a difference of two convex functions. Our simplified algorithm retains all the convergence properties of CCCP, along with a significantly lower cost per iteration. We illustrate our methods and theory via applications to the problems of best subset selection, robust estimation, mixture density estimation, and shape-from-shading reconstruction. [abs] [ pdf ][ bib ] © JMLR 2019. (edit, beta)

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — cccp algorithm

🐣 Hot Topic Early Bird — non-convex optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy

Authors

Koulik Khamaru , Martin J. Wainwright

Topics

Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Continuous Optimization

Keywords

non-convex optimization continuous optimization gradient descent convergence guarantee convergence rate non-smooth optimization cccp algorithm

Download PDF

Related papers

Adaptation Based on Generalized Discrepancy 2019

Iterated Learning in Dynamic Social Networks 2019

Pyro: Deep Universal Probabilistic Programming 2019

Matched Bipartite Block Model with Covariates 2019

Approximation Hardness for A Class of Sparse Optimization Problems 2019