A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration

Peter Welinder; Max Welling; Pietro Perona

2013 CVPR CVPR 2013

A Lazy Man's Approach to Benchmarking: Semisupervised Classifier Evaluation and Recalibration

Abstract

How many labeled examples are needed to estimate a classifier's performance on a new dataset? We study the case where data is plentiful, but labels are expensive. We show that by making a few reasonable assumptions on the structure of the data, it is possible to estimate performance curves, with confidence bounds, using a small number of ground truth labels. Our approach, which we call Semisupervised Performance Evaluation (SPE), is based on a generative model for the classifier's confidence scores. In addition to estimating the performance of classifiers on new datasets, SPE can be used to recalibrate a classifier by reestimating the class-conditional confidence distributions.

🚀 Conference Pioneer — CVPR 2013

📈 Trend Setter — Evaluation

🧭 Keyword Pioneer — classifier calibration

🐣 Hot Topic Early Bird — probabilistic modeling

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Peter Welinder , Max Welling , Pietro Perona

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Optimization & Theory > Bayesian Inference Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Core Methods > Evaluation

Keywords

semi-supervised learning probabilistic modeling generative model classifier calibration classifier evaluation performance estimation

Download PDF

Related papers

Nonlinearly Constrained MRFs: Exploring the Intrinsic Dimensions of Higher-Order Cliques 2013

An Approach to Pose-Based Action Recognition 2013

Modeling Actions through State Changes 2013

A Convex Regularizer for Reducing Color Artifact in Color Image Recovery 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences 2013