Human-grounded Evaluations of Explanation Methods for Text Classification

Piyawat Lertvittayakumjorn; Francesca Toni

2019 EMNLP EMNLP 2019

Human-grounded Evaluations of Explanation Methods for Text Classification

Abstract

AbstractDue to the black-box nature of deep learning models, methods for explaining the models’ results are crucial to gain trust from humans and support collaboration between AIs and humans. In this paper, we consider several model-agnostic and model-specific explanation methods for CNNs for text classification and conduct three human-grounded evaluations, focusing on different purposes of explanations: (1) revealing model behavior, (2) justifying model predictions, and (3) helping humans investigate uncertain predictions. The results highlight dissimilar qualities of the various explanation methods we consider and show the degree to which these methods could serve for each purpose.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — model behavior

🐣 Hot Topic Early Bird — human evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Piyawat Lertvittayakumjorn , Francesca Toni

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Application Areas > Fairness Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Applications > Text Classification Machine Learning > Core Methods > Interpretability

Keywords

text classification semantic analysis model behavior convolutional neural network human evaluation explainable artificial intelligence explanation method model-agnostic explanation

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019