Improved Dependency Parsing using Implicit Word Connections Learned from Unlabeled Data

Wenhui Wang; Baobao Chang; Mairgup Mansur

2018 EMNLP EMNLP 2018

Improved Dependency Parsing using Implicit Word Connections Learned from Unlabeled Data

Abstract

AbstractPre-trained word embeddings and language model have been shown useful in a lot of tasks. However, both of them cannot directly capture word connections in a sentence, which is important for dependency parsing given its goal is to establish dependency relations between words. In this paper, we propose to implicitly capture word connections from unlabeled data by a word ordering model with self-attention mechanism. Experiments show that these implicit word connections do improve our parsing model. Furthermore, by combining with a pre-trained language model, our model gets state-of-the-art performance on the English PTB dataset, achieving 96.35% UAS and 95.25% LAS.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — pre-trained language model

🐣 Hot Topic Early Bird — pre-trained language model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Wenhui Wang , Baobao Chang , Mairgup Mansur

Topics

Machine Learning > Core Methods > Representation Learning Deep Learning > Architectures > Transformers Natural Language Processing > Understanding > Parsing Deep Learning > Techniques > Attention Natural Language Processing > Applications > Parsing

Keywords

self-attention mechanism syntactic parsing dependency parsing pre-trained language model word embedding word ordering word connection

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018