Please Mind the Root: Decoding Arborescences for Dependency Parsing

Ran Zmigrod; Tim Vieira; Ryan Cotterell

2020 EMNLP EMNLP 2020

Please Mind the Root: Decoding Arborescences for Dependency Parsing

Abstract

AbstractThe connection between dependency trees and spanning trees is exploited by the NLP community to train and to decode graph-based dependency parsers. However, the NLP literature has missed an important difference between the two structures: only one edge may emanate from the root in a dependency tree. We analyzed the output of state-of-the-art parsers on many languages from the Universal Dependency Treebank: although these parsers are often able to learn that trees which violate the constraint should be assigned lower probabilities, their ability to do so unsurprisingly de-grades as the size of the training set decreases. In fact, the worst constraint-violation rate we observe is 24%. Prior work has proposed an inefficient algorithm to enforce the constraint, which adds a factor of n to the decoding runtime. We adapt an algorithm due to Gabow and Tarjan (1984) to dependency parsing, which satisfies the constraint without compromising the original runtime.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization and Natural Language Processing

🧭 Keyword Pioneer — graph parser

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ran Zmigrod , Tim Vieira , Ryan Cotterell

Topics

Natural Language Processing > Understanding > Parsing Mathematics & Optimization > Optimization > Combinatorial Optimization Machine Learning > Core Methods > Graph Neural Networks

Keywords

dependency parsing constraint satisfaction spanning tree constraint violation dependency tree graph parsing graph parser tree decoding

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020