The Importance of Being Recurrent for Modeling Hierarchical Structure

Ke Tran; Arianna Bisazza; Christof Monz

2018 EMNLP EMNLP 2018

The Importance of Being Recurrent for Modeling Hierarchical Structure

Abstract

AbstractRecent work has shown that recurrent neural networks (RNNs) can implicitly capture and exploit hierarchical information when trained to solve common natural language processing tasks (Blevins et al., 2018) such as language modeling (Linzen et al., 2016; Gulordava et al., 2018) and neural machine translation (Shi et al., 2016). In contrast, the ability to model structured data with non-recurrent neural networks has received little attention despite their success in many NLP tasks (Gehring et al., 2017; Vaswani et al., 2017). In this work, we compare the two architectures—recurrent versus non-recurrent—with respect to their ability to model hierarchical structure and find that recurrency is indeed important for this purpose. The code and data used in our experiments is available at https://github.com/ketranm/fan_vs_rnn

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — non-recurrent neural network

🐣 Hot Topic Early Bird — hierarchical structure

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ke Tran , Arianna Bisazza , Christof Monz

Topics

Deep Learning > Architectures > Neural Networks Natural Language Processing > Generation > Language Modeling Natural Language Processing > Applications > Machine Translation Deep Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Architectures > Recurrent Neural Networks

Keywords

neural machine translation language modeling hierarchical structure non-recurrent neural network

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018