2012
NIPS
NeurIPS 2012
Factorial LDA: Sparse Multi-Dimensional Text Models
Abstract
Multi-dimensional latent variable models can capture the many latent factors in a text corpus, such as topic, author perspective and sentiment. We introduce factorial LDA, a multi-dimensional latent variable model in which a document is influenced by K different factors, and each word token depends on a K-dimensional vector of latent variables. Our model incorporates structured word priors and learns a sparse product of factors. Experiments on research abstracts show that our model can learn latent factors such as research topic, scientific discipline, and focus (e.g. methods vs. applications.) Our modeling improvements reduce test perplexity and improve human interpretability of the discovered factors.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Machine Learning and Natural Language Processing
📈
Trend Setter
— Text Representation
🧭
Keyword Pioneer
— multi-dimensional models
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning
🐣
Hot Topic Early Bird
— topic modeling
Authors
Topics
Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling
Machine Learning > Learning Types > Unsupervised Learning
Natural Language Processing > Resources & Methods > Text Representation
Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling
Machine Learning > Core Methods > Dimensionality Reduction
Machine Learning > Core Methods > Probabilistic Modeling
Machine Learning > Learning Types > Representation Learning
Natural Language Processing > Resources & Methods > Language Modeling
Natural Language Processing > Applications > Topic Modeling