Interpreting Word-Level Hidden State Behaviour of Character-Level LSTM Language Models

Avery Hiebert; Cole Peterson; Alona Fyshe; Nishant Mehta

2018 EMNLP EMNLP 2018

Interpreting Word-Level Hidden State Behaviour of Character-Level LSTM Language Models

Abstract

AbstractWhile Long Short-Term Memory networks (LSTMs) and other forms of recurrent neural network have been successfully applied to language modeling on a character level, the hidden state dynamics of these models can be difficult to interpret. We investigate the hidden states of such a model by using the HDBSCAN clustering algorithm to identify points in the text at which the hidden state is similar. Focusing on whitespace characters prior to the beginning of a word reveals interpretable clusters that offer insight into how the LSTM may combine contextual and character-level information to identify parts of speech. We also introduce a method for deriving word vectors from the hidden state representation in order to investigate the word-level knowledge of the model. These word vectors encode meaningful semantic information even for words that appear only once in the training text.

🌱 Topic Pioneer — Interpretability

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

📈 Trend Setter — Interpretability

🐣 Hot Topic Early Bird — hidden state

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Avery Hiebert , Cole Peterson , Alona Fyshe , Nishant Mehta

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Core Methods > Clustering Machine Learning > Core Methods > Representation Learning Artificial Intelligence > Core AI > Language Deep Learning > Learning Types > Representation Learning Deep Learning > Techniques > Interpretability

Keywords

representation learning clustering algorithm hidden state long short-term memory recurrent neural network word embedding character-level model

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018