Surrogate regret bounds for generalized classification performance metrics

Wojciech Kotlowski; Krzysztof Dembczynski

2015 ACML ACML 2015

Surrogate regret bounds for generalized classification performance metrics

Abstract

We consider optimization of generalized performance metrics for binary classification by means of surrogate loss. We focus on a class of metrics, which are linear-fractional functions of the false positive and false negative rates (examples of which include $F_\\beta$-measure, Jaccard similarity coefficient, AM measure, and many others). Our analysis concerns the following two-step procedure. First, a real-valued function $f$ is learned by minimizing a surrogate loss for binary classification on the training sample. It is assumed that the surrogate loss is a strongly proper composite loss function (examples of which include logistic loss, squared-error loss, exponential loss, etc.). Then, given $f$, a threshold $\\hat{\\theta}$ is tuned on a separate validation sample, by direct optimization of the target performance measure. We show that the regret of the resulting classifier (obtained from thresholding $f$ on $\\hat{\\theta}$ measured with respect to the target metric is upperbounded by the regret of f measured with respect to the surrogate loss. Our finding is further analyzed in a computational study on both synthetic and real data sets.

📈 Trend Setter — Loss Functions

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🐣 Hot Topic Early Bird — binary classification

Authors

Wojciech Kotlowski , Krzysztof Dembczynski

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Loss Functions

Keywords

binary classification performance metric surrogate loss regret bound loss function

Download PDF

Related papers

Continuous Target Shift Adaptation in Supervised Learning 2015

Statistical Unfolded Logic Learning 2015

Integration of Single-view Graphs with Diffusion of Tensor Product Graphs for Multi-view Spectral Clustering 2015

Class-prior Estimation for Learning from Positive and Unlabeled Data 2015

Expectation Propagation for Rectified Linear Poisson Regression 2015