2012
AISTATS
AISTATS 2012
On Estimation and Selection for Topic Models
Abstract
This article describes posterior maximization for topic models, identifying computational and conceptual gains from inference under a non-standard parametrization. We then show that fitted parameters can be used as the basis for a novel approach to marginal likelihood estimation, via block-diagonal approximation to the information matrix, that facilitates choosing the number of latent topics. This likelihood-based model selection is complemented with a goodness-of-fit analysis built around estimated residual dispersion. Examples are provided to illustrate model selection as well as to compare our estimation against standard alternative techniques.
🧭
Keyword Pioneer
— latent topics
🐣
Hot Topic Early Bird
— model selection
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Optimization & Theory > Learning Theory
Machine Learning > Optimization & Theory > Statistical Learning
Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling
Machine Learning > Bayesian & Probabilistic > Bayesian Inference
Machine Learning > Core Methods > Model Selection
Machine Learning > Learning Types > Model Selection