Confidence Scoring Using Whitebox Meta-models with Linear Classifier Probes

Tongfei Chen; Jiri Navratil; Vijay Iyengar; Karthikeyan Shanmugam

2019 AISTATS AISTATS 2019

Confidence Scoring Using Whitebox Meta-models with Linear Classifier Probes

Abstract

We propose a novel confidence scoring mechanism for deep neural networks based on a two-model paradigm involving a base model and a meta-model. The confidence score is learned by the meta-model observing the base model succeeding/failing at its task. As features to the meta-model, we investigate linear classifier probes inserted between the various layers of the base model. Our experiments demonstrate that this approach outperforms multiple baselines in a filtering task, i.e., task of rejecting samples with low confidence. Experimental results are presented using CIFAR-10 and CIFAR-100 dataset with and without added noise. We discuss the importance of confidence scoring to bridge the gap between experimental and real-world applications.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning

🧭 Keyword Pioneer — linear classifier probe

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Tongfei Chen , Jiri Navratil , Vijay Iyengar , Karthikeyan Shanmugam

Topics

Artificial Intelligence > Core AI > Interpretability Artificial Intelligence > Learning Paradigms > Meta-Learning Deep Learning > Techniques > Model Architecture Computer Vision > Analysis > Anomaly Detection Machine Learning > Learning Paradigms > Meta-Learning Deep Learning > Learning Types > Meta-Learning

Keywords

early exit convolutional neural network deep neural network confidence scoring rejection sampling linear classifier probe

Download PDF

Related papers

Inferring Multidimensional Rates of Aging from Cross-Sectional Data 2019

On the Interaction Effects Between Prediction and Clustering 2019

Efficient Linear Bandits through Matrix Sketching 2019

An Optimal Algorithm for Stochastic Three-Composite Optimization 2019

Efficient Inference in Multi-task Cox Process Models 2019