What do RNN Language Models Learn about Filler–Gap Dependencies?

Ethan Wilcox; Roger Levy; Takashi Morita; Richard Futrell

2018 EMNLP EMNLP 2018

What do RNN Language Models Learn about Filler–Gap Dependencies?

Abstract

AbstractRNN language models have achieved state-of-the-art perplexity results and have proven useful in a suite of NLP tasks, but it is as yet unclear what syntactic generalizations they learn. Here we investigate whether state-of-the-art RNN language models represent long-distance filler–gap dependencies and constraints on them. Examining RNN behavior on experimentally controlled sentences designed to expose filler–gap dependencies, we show that RNNs can represent the relationship in multiple syntactic positions and over large spans of text. Furthermore, we show that RNNs learn a subset of the known restrictions on filler–gap dependencies, known as island constraints: RNNs show evidence for wh-islands, adjunct islands, and complex NP islands. These studies demonstrates that state-of-the-art RNN models are able to learn and generalize about empty syntactic positions.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Interdisciplinary

🧭 Keyword Pioneer — syntactic generalization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ethan Wilcox , Roger Levy , Takashi Morita , Richard Futrell

Topics

Artificial Intelligence > Core AI > Foundation Models Interdisciplinary > Linguistics > Computational Linguistics Artificial Intelligence > Core AI > Language Deep Learning > Learning Types > Representation Learning Deep Learning > Models > Language Models

Keywords

recurrent neural network language model syntactic generalization filler-gap dependency island constraint

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018