A recursive estimate for the predictive likelihood in a topic model

James Scott; Jason Baldridge

2013 AISTATS AISTATS 2013

A recursive estimate for the predictive likelihood in a topic model

Abstract

We consider the problem of evaluating the predictive log likelihood of a previously un- seen document under a topic model. This task arises when cross-validating for a model hyperparameter, when testing a model on a hold-out set, and when comparing the performance of different fitting strategies. Yet it is known to be very challenging, as it is equivalent to estimating a marginal likelihood in Bayesian model selection. We propose a fast algorithm for approximating this likelihood, one whose computational cost is linear both in document length and in the number of topics. The method is a first-order approximation to the algorithm of Carvalho et al. (2010), and can also be interpreted as a one-particle, Rao-Blackwellized version of the "left-to-right" method of Wallach et al. (2009). On our test examples, the proposed method gives similar answers to these other methods, but at lower computational cost.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

📈 Trend Setter — Language Modeling

🧭 Keyword Pioneer — predictive likelihood

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

James Scott , Jason Baldridge

Topics

Artificial Intelligence > Bayesian & Probabilistic > Bayesian Learning Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling Natural Language Processing > Resources & Methods > Language Modeling Machine Learning > Bayesian & Probabilistic > Bayesian Inference

Keywords

bayesian inference marginal likelihood bayesian model selection topic model predictive likelihood rao-blackwellized estimation

Download PDF

Related papers

Consensus Ranking with Signed Permutations 2013

Ultrahigh Dimensional Feature Screening via RKHS Embeddings 2013

Collapsed Variational Bayesian Inference for Hidden Markov Models 2013

Learning Social Infectivity in Sparse Low-rank Networks Using Multi-dimensional Hawkes Processes 2013

Evidence Estimation for Bayesian Partially Observed MRFs 2013