Boosting with the Logistic Loss is Consistent

Matus Telgarsky

2013 COLT COLT 2013

Boosting with the Logistic Loss is Consistent

Abstract

This manuscript provides optimization guarantees, generalization bounds, and statistical consistency results for AdaBoost variants which replace the exponential loss with the logistic and similar losses (specifically, twice differentiable convex losses which are Lipschitz and tend to zero on one side).The heart of the analysis is to show that, in lieu of explicit regularization and constraints, the structure of the problem is fairly rigidly controlled by the source distribution itself. The first control of this type is in the separable case, where a distribution-dependent relaxed weak learning rate induces speedy convergence with high probability over any sample. Otherwise, in the nonseparable case, the convex surrogate risk itself exhibits distribution-dependent levels of curvature, and consequently the algorithm’s output has small norm with high probability.

🧭 Keyword Pioneer — surrogate risk

🐣 Hot Topic Early Bird — empirical risk minimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Matus Telgarsky

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Machine Learning > Optimization & Theory > Statistical Learning

Keywords

statistical consistency convex optimization empirical risk minimization logistic loss generalization bound surrogate risk

Download PDF

Related papers

A Tensor Spectral Approach to Learning Mixed Membership Community Models 2013

Adaptive Crowdsourcing Algorithms for the Bandit Survey Problem 2013

Online Learning with Predictable Sequences 2013

Recovering the Optimal Solution by Dual Random Projection 2013

Opportunistic Strategies for Generalized No-Regret Problems 2013