2023
AAAI
AAAI 2023
Improving Interpretability via Explicit Word Interaction Graph Layer
Abstract
Abstract Recent NLP literature has seen growing interest in improving model interpretability. Along this direction, we propose a trainable neural network layer that learns a global interaction graph between words and then selects more informative words using the learned word interactions. Our layer, we call WIGRAPH, can plug into any neural network-based NLP text classifiers right after its word embedding layer. Across multiple SOTA NLP models and various NLP datasets, we demonstrate that adding the WIGRAPH layer substantially improves NLP models' interpretability and enhances models' prediction performance at the same time.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing
📈
Trend Setter
— Interpretability
🧭
Keyword Pioneer
— word interaction graph
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Interpretability
Machine Learning > Core Methods > Representation Learning
Deep Learning > Architectures > Graph Neural Networks
Natural Language Processing > Applications > Text Classification
Machine Learning > Core Methods > Feature Learning
Natural Language Processing > Applications > Natural Language Understanding
Deep Learning > Optimization & Theory > Interpretability