A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning

Jin Yu; S.v.n. Vishwanathan; Simon Günter; Nicol N. Schraudolph

2010 JMLR JMLR 2010

A Quasi-Newton Approach to Nonsmooth Convex Optimization Problems in Machine Learning

Abstract

We extend the well-known BFGS quasi-Newton method and its memory-limited variant LBFGS to the optimization of nonsmooth convex objectives. This is done in a rigorous fashion by generalizing three components of BFGS to subdifferentials: the local quadratic model, the identification of a descent direction, and the Wolfe line search conditions. We prove that under some technical conditions, the resulting subBFGS algorithm is globally convergent in objective function value. We apply its memory-limited variant (subLBFGS) to L2-regularized risk minimization with the binary hinge loss. To extend our algorithm to the multiclass and multilabel settings, we develop a new, efficient, exact line search algorithm. We prove its worst-case time complexity bounds, and show that our line search can also be used to extend a recently developed bundle method to the multiclass and multilabel settings. We also apply the direction-finding component of our algorithm to L1-regularized risk minimization with logistic loss. In all these contexts our methods perform comparable to or better than specialized state-of-the-art solvers on a number of publicly available data sets. An open source implementation of our algorithms is freely available. [abs] [ pdf ][ bib ] © JMLR 2010. (edit, beta)

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — nonsmooth optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jin Yu , S.v.n. Vishwanathan , Simon Günter , Nicol N. Schraudolph

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Continuous Optimization

Keywords

convex optimization nonsmooth optimization quasi-newton method hinge loss

Download PDF

Related papers

A Fast Hybrid Algorithm for Large-Scale -Regularized Logistic Regression 2010

Model-based Boosting 2.0 2010

On Learning with Integral Operators 2010

Generalized Expectation Criteria for Semi-Supervised Learning with Weakly Labeled Data 2010

Hilbert Space Embeddings and Metrics on Probability Measures 2010