Rethinking Self-Attention: Towards Interpretability in Neural Parsing

Khalil Mrini; Franck Dernoncourt; Quan Hung Tran; Trung Bui; Walter Chang; Ndapa Nakashole

2020 EMNLP EMNLP 2020

Rethinking Self-Attention: Towards Interpretability in Neural Parsing

Abstract

AbstractAttention mechanisms have improved the performance of NLP tasks while allowing models to remain explainable. Self-attention is currently widely used, however interpretability is difficult due to the numerous attention distributions. Recent work has shown that model representations can benefit from label-specific information, while facilitating interpretation of predictions. We introduce the Label Attention Layer: a new form of self-attention where attention heads represent labels. We test our novel layer by running constituency and dependency parsing experiments and show our new model obtains new state-of-the-art results for both tasks on both the Penn Treebank (PTB) and Chinese Treebank. Additionally, our model requires fewer self-attention layers compared to existing work. Finally, we find that the Label Attention heads learn relations between syntactic categories and show pathways to analyze errors.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Khalil Mrini , Franck Dernoncourt , Quan Hung Tran , Trung Bui , Walter Chang , Ndapa Nakashole

Topics

Artificial Intelligence > Core AI > Interpretability Deep Learning > Architectures > Transformers Natural Language Processing > Understanding > Parsing Artificial Intelligence > Core AI > Attention Natural Language Processing > Applications > Parsing

Keywords

dependency parsing constituency parsing syntactic category neural network label attention

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020