State Gradients for RNN Memory Analysis

Lyan Verwimp; Hugo Van hamme; Vincent Renkens; Patrick Wambacq

2018 EMNLP EMNLP 2018

State Gradients for RNN Memory Analysis

Abstract

AbstractWe present a framework for analyzing what the state in RNNs remembers from its input embeddings. We compute the gradients of the states with respect to the input embeddings and decompose the gradient matrix with Singular Value Decomposition to analyze which directions in the embedding space are best transferred to the hidden state space, characterized by the largest singular values. We apply our approach to LSTM language models and investigate to what extent and for how long certain classes of words are remembered on average for a certain corpus. Additionally, the extent to which a specific property or relationship is remembered by the RNN can be tracked by comparing a vector characterizing that property with the direction(s) in embedding space that are best preserved in hidden state space.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Representation Learning

🧭 Keyword Pioneer — state gradient

🐣 Hot Topic Early Bird — embedding space

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lyan Verwimp , Hugo Van hamme , Vincent Renkens , Patrick Wambacq

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Theory Natural Language Processing > Generation > Language Modeling Machine Learning > Optimization & Theory > Representation Learning Deep Learning > Architectures > Recurrent Neural Networks

Keywords

embedding space singular value decomposition hidden state recurrent neural network language model gradient analysis state gradient rnn memory memory analysis

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018