A Primal-Dual Convergence Analysis of Boosting

Matus Telgarsky

2012 JMLR JMLR 2012

A Primal-Dual Convergence Analysis of Boosting

Abstract

Boosting combines weak learners into a predictor with low empirical risk. Its dual constructs a high entropy distribution upon which weak learners and training labels are uncorrelated. This manuscript studies this primal-dual relationship under a broad family of losses, including the exponential loss of AdaBoost and the logistic loss, revealing: • Weak learnability aids the whole loss family: for any ε > 0, O(ln(1/ε)) iterations suffice to produce a predictor with empirical risk ε-close to the infimum; • The circumstances granting the existence of an empirical risk minimizer may be characterized in terms of the primal and dual problems, yielding a new proof of the known rate O(ln(1/ε)); • Arbitrary instances may be decomposed into the above two, granting rate O(1/ε), with a matching lower bound provided for the logistic loss. [abs] [ pdf ][ bib ] © JMLR 2012. (edit, beta)

🐣 Hot Topic Early Bird — convergence analysis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Matus Telgarsky

Topics

Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Optimization

Keywords

primal-dual optimization convergence analysis empirical risk minimization weak learnability

Download PDF

Related papers

Plug-in Approach to Active Learning 2012

An Active Learning Algorithm for Ranking from Pairwise Preferences with an Almost Optimal Query Complexity 2012

Eliminating Spammers and Ranking Annotators for Crowdsourced Labeling Tasks 2012

GPLP: A Local and Parallel Computation Toolbox for Gaussian Process Regression 2012

Query Strategies for Evading Convex-Inducing Classifiers 2012