Neyman-Pearson classification under a strict constraint

Philippe Rigollet; Xin Tong

2011 COLT COLT 2011

Neyman-Pearson classification under a strict constraint

Abstract

Motivated by problems of anomaly detection, this paper implements the Neyman-Pearson paradigm to deal with asymmetric errors in binary classification with a convex loss. Given a finite collection of classifiers, we combine them and obtain a new classifier that satisfies simultaneously the two following properties with high probability: (i), its probability of type I error is below a pre-specified level and (ii), it has probability of type II error close to the minimum possible. The proposed classifier is obtained by minimizing an empirical objective subject to an empirical constraint. The novelty of the method is that the classifier output by this problem is shown to satisfy the original constraint on type I error. This strict enforcement of the constraint has interesting consequences on the control of the type II error and we develop new techniques to handle this situation. Finally, connections with chance constrained optimization are evident and are investigated.

🚀 Conference Pioneer — COLT 2011

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — type i error

🐝 Cross-Pollinator — Artificial Intelligence, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio

📈 Trend Setter — Risk Management

🐣 Hot Topic Early Bird — binary classification

Authors

Philippe Rigollet , Xin Tong

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Application Areas > Risk Management Mathematics & Optimization > Optimization > Stochastic Methods Machine Learning > Learning Types > Classification

Keywords

binary classification anomaly detection convex loss type i error type ii error neyman-pearson classification asymmetric error chance constrained optimization

Download PDF

Related papers

Competitive Closeness Testing 2011

Bandits, Query Learning, and the Haystack Dimension 2011

Minimax Policies for Combinatorial Prediction Games 2011

Sample Complexity Bounds for Differentially Private Learning 2011

Multiclass Learnability and the ERM principle 2011