IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding

Peijian Gu; Yaozong Shen; Lijie Wang; Quan Wang; Hua Wu; Zhendong Mao

2023 EMNLP EMNLP 2023

IAEval: A Comprehensive Evaluation of Instance Attribution on Natural Language Understanding

Abstract

AbstractInstance attribution (IA) aims to identify the training instances leading to the prediction of a test example, helping researchers understand the dataset better and optimize data processing. While many IA methods have been proposed recently, how to evaluate them still remains open. Previous evaluations of IA only focus on one or two dimensions and are not comprehensive. In this work, we introduce IAEval for IA methods, a systematic and comprehensive evaluation scheme covering four significant requirements: sufficiency, completeness, stability and plausibility. We elaborately design novel metrics to measure these requirements for the first time. Three representative IA methods are evaluated under IAEval on four natural language understanding datasets. Extensive experiments confirmed the effectiveness of IAEval and exhibited its ability to provide comprehensive comparison among IA methods. With IAEval, researchers can choose the most suitable IA methods for applications like model debugging.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Peijian Gu , Yaozong Shen , Lijie Wang , Quan Wang , Hua Wu , Zhendong Mao

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Application Areas > Fairness Natural Language Processing > Understanding Machine Learning > Optimization & Theory > Evaluation Machine Learning > Core Methods > Interpretability Natural Language Processing > Applications > Natural Language Understanding

Keywords

natural language understanding model interpretability model debugging evaluation metric evaluation metrics instance attribution dataset optimization dataset debugging

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023