Expectation Maximization and Posterior Constraints

Kuzman Ganchev; Ben Taskar; João Gama

2007 NIPS NeurIPS 2007

Expectation Maximization and Posterior Constraints

Abstract

The expectation maximization (EM) algorithm is a widely used maximum likelihood estimation procedure for statistical models when the values of some of the variables in the model are not observed. Very often, however, our aim is primarily to find a model that assigns values to the latent variables that have intended meaning for our data and maximizing expected likelihood only sometimes accomplishes this. Unfortunately, it is typically difficult to add even simple a-priori information about latent variables in graphical models without making the models overly complex or intractable. In this paper, we present an efficient, principled way to inject rich constraints on the posteriors of latent variables into the EM algorithm. Our method can be used to learn tractable graphical models that satisfy additional, otherwise intractable constraints. Focusing on clustering and the alignment problem for statistical machine translation, we show that simple, intuitive posterior constraints can greatly improve the performance over standard baselines and be competitive with more complex, intractable models.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

📈 Trend Setter — Weakly Supervised Learning

🧭 Keyword Pioneer — posterior constraints

🐣 Hot Topic Early Bird — graphical model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Kuzman Ganchev , Ben Taskar , João Gama

Topics

Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Core Methods > Clustering Machine Learning > Learning Types > Weakly Supervised Learning Machine Learning > Optimization & Theory > Statistical Learning Machine Learning > Core Methods > Probabilistic Modeling Machine Learning > Bayesian & Probabilistic > Bayesian Inference Machine Learning > Learning Types > Classification Machine Learning > Learning Types > Clustering

Keywords

clustering expectation maximization posterior constraints clustering algorithm graphical model latent variable

Download PDF

Related papers

Exponential Family Predictive Representations of State 2007

Privacy-Preserving Belief Propagation and Sampling 2007

Efficient Principled Learning of Thin Junction Trees 2007

How SVMs can estimate quantiles and the median 2007

Rapid Inference on a Novel AND/OR graph for Object Detection, Segmentation and Parsing 2007