A Study on Trust Region Update Rules in Newton Methods for Large-scale Linear Classification

Chih-Yang Hsia; Ya Zhu; Chih-Jen Lin

2017 ACML ACML 2017

A Study on Trust Region Update Rules in Newton Methods for Large-scale Linear Classification

Abstract

The main task in training a linear classifier is to solve an unconstrained minimization problem. To apply an optimization method typically we iteratively find a good direction and then decide a suitable step size. Past developments of extending optimization methods for large-scale linear classification focus on finding the direction, but little attention has been paid on adjusting the step size. In this work, we explain that inappropriate step-size adjustment may lead to serious slow convergence. Among the two major methods for step-size selection, line search and trust region, we focus on investigating the trust region methods. After presenting some detailed analysis, we develop novel and effective techniques to adjust the trust-region size. Experiments indicate that our new settings significantly outperform existing implementations for large-scale linear classification.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

Authors

Chih-Yang Hsia , Ya Zhu , Chih-Jen Lin

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Continuous Optimization

Keywords

large-scale optimization linear classification step size newton method trust region method

Download PDF

Related papers

PHD: A Probabilistic Model of Hybrid Deep Collaborative Filtering for Recommender Systems 2017

Recognizing Art Style Automatically in Painting with Deep Learning 2017

Locally Smoothed Neural Networks 2017

Adaptive Sampling Scheme for Learning in Severely Imbalanced Large Scale Data 2017

Learning Predictive Leading Indicators for Forecasting Time Series Systems with Unknown Clusters of Forecast Tasks 2017