Heads-up! Unsupervised Constituency Parsing via Self-Attention Heads

Bowen Li; Taeuk Kim; Reinald Kim Amplayo; Frank Keller

2020 AACL AACL 2020

Heads-up! Unsupervised Constituency Parsing via Self-Attention Heads

Abstract

AbstractTransformer-based pre-trained language models (PLMs) have dramatically improved the state of the art in NLP across many tasks. This has led to substantial interest in analyzing the syntactic knowledge PLMs learn. Previous approaches to this question have been limited, mostly using test suites or probes. Here, we propose a novel fully unsupervised parsing approach that extracts constituency trees from PLM attention heads. We rank transformer attention heads based on their inherent properties, and create an ensemble of high-ranking heads to produce the final tree. Our method is adaptable to low-resource languages, as it does not rely on development sets, which can be expensive to annotate. Our experiments show that the proposed method often outperform existing approaches if there is no development set present. Our unsupervised parser can also be used as a tool to analyze the grammars PLMs learn implicitly. For this, we use the parse trees induced by our method to train a neural PCFG and compare it to a grammar derived from a human-annotated treebank.

🚀 Conference Pioneer — AACL 2020

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — attention head

🐣 Hot Topic Early Bird — pre-trained language model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Bowen Li , Taeuk Kim , Reinald Kim Amplayo , Frank Keller

Topics

Machine Learning > Learning Types > Unsupervised Learning Deep Learning > Architectures > Transformers Natural Language Processing > Understanding > Parsing

Keywords

unsupervised learning self-attention mechanism grammar induction constituency parsing attention head pre-trained language model

Download PDF

Related papers

Can Monolingual Pretrained Models Help Cross-Lingual Classification? 2020

Text Simplification with Reinforcement Learning Using Supervised Rewards on Grammaticality, Meaning Preservation, and Simplicity 2020

ISA: An Intelligent Shopping Assistant 2020

Social Media Medical Concept Normalization using RoBERTa in Ontology Enriched Text Similarity Framework 2020

Overcoming Resistance: The Normalization of an Amazonian Tribal Language 2020