Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Christian Tomani; Florian Buettner

2021 AAAI AAAI 2021

Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Abstract

Abstract To facilitate a wide-spread acceptance of AI systems guiding decision making in real-world applications, trustworthiness of deployed models is key. That is, it is crucial for predictive models to be uncertainty-aware and yield well-calibrated (and thus trustworthy) predictions for both in-domain samples as well as under domain shift. Recent efforts to account for predictive uncertainty include post-processing steps for trained neural networks, Bayesian neural networks as well as alternative non-Bayesian approaches such as ensemble approaches and evidential deep learning. Here, we propose an efficient yet general modelling approach for obtaining well-calibrated, trustworthy probabilities for samples obtained after a domain shift. We introduce a new training strategy combining an entropy-encouraging loss term with an adversarial calibration loss term and demonstrate that this results in well-calibrated and technically trustworthy predictions for a wide range of domain drifts. We comprehensively evaluate previously proposed approaches on different data modalities, a large range of data sets including sequence data, network architectures and perturbation strategies. We observe that our modelling approach substantially outperforms existing state-of-the-art approaches, yielding well-calibrated predictions under domain drift.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — adversarial calibration

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Christian Tomani , Florian Buettner

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Loss Functions Machine Learning > Application Areas > Fairness Deep Learning > Techniques > Model Architecture Deep Learning > Optimization & Theory > Neural Network Optimization Deep Learning > Learning Types > Adversarial Learning Machine Learning > Learning Types > Uncertainty Quantification

Keywords

uncertainty quantification model calibration deep neural network domain shift predictive uncertainty neural network adversarial calibration entropy-encouraging loss entropy loss trustworthy prediction

Download PDF

Related papers

Contextual Conditional Reasoning 2021

Attention Beam: An Image Captioning Approach (Student Abstract) 2021

Movie Summarization via Sparse Graph Construction 2021

Text Analysis for Understanding Symptoms of Social Anxiety in Student Veterans 2021

ERNIE-ViL: Knowledge Enhanced Vision-Language Representations through Scene Graphs 2021