Deep RNNs Encode Soft Hierarchical Syntax

Terra Blevins; Omer Levy; Luke Zettlemoyer

2018 ACL ACL 2018

Deep RNNs Encode Soft Hierarchical Syntax

Abstract

AbstractWe present a set of experiments to demonstrate that deep recurrent neural networks (RNNs) learn internal representations that capture soft hierarchical notions of syntax from highly varied supervision. We consider four syntax tasks at different depths of the parse tree; for each word, we predict its part of speech as well as the first (parent), second (grandparent) and third level (great-grandparent) constituent labels that appear above it. These predictions are made from representations produced at different depths in networks that are pretrained with one of four objectives: dependency parsing, semantic role labeling, machine translation, or language modeling. In every case, we find a correspondence between network depth and syntactic depth, suggesting that a soft syntactic hierarchy emerges. This effect is robust across all conditions, indicating that the models encode significant amounts of syntax even in the absence of an explicit syntactic training supervision.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — syntactic depth

🐣 Hot Topic Early Bird — hierarchical structure

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Terra Blevins , Omer Levy , Luke Zettlemoyer

Topics

Machine Learning > Core Methods > Representation Learning Natural Language Processing > Understanding > Part-of-Speech Tagging Natural Language Processing > Understanding > Syntax Deep Learning > Architectures > Recurrent Neural Networks

Keywords

representation learning syntactic parsing part-of-speech tagging hierarchical structure recurrent neural network internal representation parse tree syntactic depth

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018