Towards Benchmarking the Utility of Explanations for Model Debugging

Maximilian Idahl; Lijun Lyu; Ujwal Gadiraju; Avishek Anand

2021 NAACL NAACL 2021

Towards Benchmarking the Utility of Explanations for Model Debugging

Abstract

AbstractPost-hoc explanation methods are an important class of approaches that help understand the rationale underlying a trained model’s decision. But how useful are they for an end-user towards accomplishing a given task? In this vision paper, we argue the need for a benchmark to facilitate evaluations of the utility of post-hoc explanation methods. As a first step to this end, we enumerate desirable properties that such a benchmark should possess for the task of debugging text classifiers. Additionally, we highlight that such a benchmark facilitates not only assessing the effectiveness of explanations but also their efficiency.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — explanation utility

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Maximilian Idahl , Lijun Lyu , Ujwal Gadiraju , Avishek Anand

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Learning Types > Weakly Supervised Learning Natural Language Processing > Applications > Text Classification Machine Learning > Optimization & Theory > Evaluation

Keywords

benchmark evaluation model debugging post-hoc explanation explanation method text classifier explanation utility

Download PDF

Related papers

Knowledge Router: Learning Disentangled Representations for Knowledge Graphs 2021

Cross-Task Instance Representation Interactions and Label Dependencies for Joint Information Extraction with Graph Convolutional Networks 2021

Abstract Meaning Representation Guided Graph Encoding and Decoding for Joint Information Extraction 2021

Beyond Fair Pay: Ethical Implications of NLP Crowdsourcing 2021

Probing Word Translations in the Transformer and Trading Decoder for Encoder Layers 2021