Word and Document Embedding with vMF-Mixture Priors on Context Word Vectors

Shoaib Jameel; Steven Schockaert

2019 ACL ACL 2019

Word and Document Embedding with vMF-Mixture Priors on Context Word Vectors

Abstract

AbstractWord embedding models typically learn two types of vectors: target word vectors and context word vectors. These vectors are normally learned such that they are predictive of some word co-occurrence statistic, but they are otherwise unconstrained. However, the words from a given language can be organized in various natural groupings, such as syntactic word classes (e.g. nouns, adjectives, verbs) and semantic themes (e.g. sports, politics, sentiment). Our hypothesis in this paper is that embedding models can be improved by explicitly imposing a cluster structure on the set of context word vectors. To this end, our model relies on the assumption that context word vectors are drawn from a mixture of von Mises-Fisher (vMF) distributions, where the parameters of this mixture distribution are jointly optimized with the word vectors. We show that this results in word vectors which are qualitatively different from those obtained with existing word embedding models. We furthermore show that our embedding model can also be used to learn high-quality document representations.

🧭 Keyword Pioneer — semantic clustering

🐣 Hot Topic Early Bird — word embedding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Robotics, Speech & Audio

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

Authors

Shoaib Jameel , Steven Schockaert

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Core Methods > Embedding Learning Machine Learning > Learning Types > Unsupervised Learning Natural Language Processing > Resources & Methods > Text Representation Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Natural Language Processing > Resources & Methods > Language Modeling Deep Learning > Learning Types > Representation Learning

Keywords

document representation mixture model semantic clustering word embedding von mises-fisher distribution context vector context word vector

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019