Evaluating Attribution for Graph Neural Networks

Benjamin Sanchez-Lengeling; Jennifer Wei; Brian Lee; Emily Reif; Peter Wang; Wesley Qian; Kevin McCloskey; Lucy Colwell; Alexander Wiltschko

2020 NIPS NeurIPS 2020

Evaluating Attribution for Graph Neural Networks

Abstract

Interpretability of machine learning models is critical to scientific understanding, AI safety, as well as debugging. Attribution is one approach to interpretability, which highlights input dimensions that are influential to a neural network’s prediction. Evaluation of these methods is largely qualitative for image and text models, because acquiring ground truth attributions requires expensive and unreliable human judgment. Attribution has been little studied for graph neural networks (GNNs), a model class of growing importance that makes predictions on arbitrarily-sized graphs. In this work we adapt commonly-used attribution methods for GNNs and quantitatively evaluate them using computable ground-truths that are objective and challenging to learn. We make concrete recommendations for which attribution methods to use, and provide the data and code for our benchmarking suite. Rigorous and open source benchmarking of attribution methods in graphs could enable new methods development and broader use of attribution in real-world ML tasks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🧭 Keyword Pioneer — model benchmarking

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Benjamin Sanchez-Lengeling , Jennifer Wei , Brian Lee , Emily Reif , Peter Wang , Wesley Qian , Kevin McCloskey , Lucy Colwell , Alexander Wiltschko

Topics

Artificial Intelligence > Core AI > Interpretability Deep Learning > Architectures > Graph Neural Networks Machine Learning > Core Methods > Interpretability

Keywords

model benchmarking ground truth attribution method graph neural network

Download PDF

Related papers

Higher-Order Spectral Clustering of Directed Graphs 2020

Self-Supervised MultiModal Versatile Networks 2020

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates 2020

Causal Intervention for Weakly-Supervised Semantic Segmentation 2020

Taming Discrete Integration via the Boon of Dimensionality 2020