Colorless Green Recurrent Networks Dream Hierarchically

Kristina Gulordava; Piotr Bojanowski; Edouard Grave; Tal Linzen; Marco Baroni

2018 NAACL NAACL 2018

Colorless Green Recurrent Networks Dream Hierarchically

Abstract

AbstractRecurrent neural networks (RNNs) achieved impressive results in a variety of linguistic processing tasks, suggesting that they can induce non-trivial properties of language. We investigate to what extent RNNs learn to track abstract hierarchical syntactic structure. We test whether RNNs trained with a generic language modeling objective in four languages (Italian, English, Hebrew, Russian) can predict long-distance number agreement in various constructions. We include in our evaluation nonsensical sentences where RNNs cannot rely on semantic or lexical cues (“The colorless green ideas I ate with the chair sleep furiously”), and, for Italian, we compare model performance to human intuitions. Our language-model-trained RNNs make reliable predictions about long-distance agreement, and do not lag much behind human performance. We thus bring support to the hypothesis that RNNs are not just shallow-pattern extractors, but they also acquire deeper grammatical competence.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Interdisciplinary and Natural Language Processing

📈 Trend Setter — Large Language Models

🧭 Keyword Pioneer — grammatical competence

🐣 Hot Topic Early Bird — hierarchical structure

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kristina Gulordava , Piotr Bojanowski , Edouard Grave , Tal Linzen , Marco Baroni

Topics

Deep Learning > Architectures > Neural Networks Natural Language Processing > Generation > Language Modeling Interdisciplinary > Linguistics > Computational Linguistics Artificial Intelligence > Core AI > Large Language Models Deep Learning > Models > Neural Networks Artificial Intelligence > Core AI > Language

Keywords

language modeling hierarchical structure recurrent neural network language model syntactic structure number agreement grammatical competence long-distance agreement

Download PDF

Related papers

A Melody-Conditioned Lyrics Language Model 2018

Before Name-Calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation 2018

Automated Essay Scoring in the Presence of Biased Ratings 2018

Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input 2018

QuickEdit: Editing Text & Translations by Crossing Words Out 2018