Analysing Word Representation from the Input and Output Embeddings in Neural Network Language Models

Steven Derby; Paul Miller; Barry Devereux

2020 EMNLP EMNLP 2020

Analysing Word Representation from the Input and Output Embeddings in Neural Network Language Models

Abstract

AbstractResearchers have recently demonstrated that tying the neural weights between the input look-up table and the output classification layer can improve training and lower perplexity on sequence learning tasks such as language modelling. Such a procedure is possible due to the design of the softmax classification layer, which previous work has shown to comprise a viable set of semantic representations for the model vocabulary, and these these output embeddings are known to perform well on word similarity benchmarks. In this paper, we make meaningful comparisons between the input and output embeddings and other SOTA distributional models to gain a better understanding of the types of information they represent. We also construct a new set of word embeddings using the output embeddings to create locally-optimal approximations for the intermediate representations from the language model. These locally-optimal embeddings demonstrate excellent performance across all our evaluations.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Steven Derby , Paul Miller , Barry Devereux

Topics

Machine Learning > Core Methods > Embedding Learning Deep Learning > Architectures > Neural Networks Natural Language Processing > Generation > Language Modeling Natural Language Processing > Resources & Methods > Language Modeling Deep Learning > Learning Types > Representation Learning Deep Learning > Models > Language Models

Keywords

language model word embedding word similarity neural language model embedding analysis input embedding output embedding neural network softmax classification

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020