Expletives in Universal Dependency Treebanks

Gosse Bouma; Jan Hajic; Dag Haug; Joakim Nivre; Per Erik Solberg; Lilja Øvrelid

2018 EMNLP EMNLP 2018

Expletives in Universal Dependency Treebanks

Abstract

AbstractAlthough treebanks annotated according to the guidelines of Universal Dependencies (UD) now exist for many languages, the goal of annotating the same phenomena in a cross-linguistically consistent fashion is not always met. In this paper, we investigate one phenomenon where we believe such consistency is lacking, namely expletive elements. Such elements occupy a position that is structurally associated with a core argument (or sometimes an oblique dependent), yet are non-referential and semantically void. Many UD treebanks identify at least some elements as expletive, but the range of phenomena differs between treebanks, even for closely related languages, and sometimes even for different treebanks for the same language. In this paper, we present criteria for identifying expletives that are applicable across languages and compatible with the goals of UD, give an overview of expletives as found in current UD treebanks, and present recommendations for the annotation of expletives so that more consistent annotation can be achieved in future releases.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Interdisciplinary and Natural Language Processing

🧭 Keyword Pioneer — dependency treebank

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Robotics, Speech & Audio

Authors

Gosse Bouma , Jan Hajic , Dag Haug , Joakim Nivre , Per Erik Solberg , Lilja Øvrelid

Topics

Interdisciplinary > Linguistics > Computational Linguistics Interdisciplinary > Linguistics > Morphology Artificial Intelligence > Core AI > Language Natural Language Processing > Applications > Text Processing

Keywords

universal dependencies syntactic analysis cross-linguistic analysis treebank annotation syntactic annotation dependency treebank cross-linguistic annotation expletive element cross-linguistic consistency

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018