Learning a Small Mixture of Trees

M. P. Kumar; Daphne Koller

2009 NIPS NeurIPS 2009

Learning a Small Mixture of Trees

Abstract

The problem of approximating a given probability distribution using a simpler distribution plays an important role in several areas of machine learning, e.g. variational inference and classification. Within this context, we consider the task of learning a mixture of tree distributions. Although mixtures of trees can be learned by minimizing the KL-divergence using an EM algorithm, its success depends heavily on the initialization. We propose an efficient strategy for obtaining a good initial set of trees that attempts to cover the entire observed distribution by minimizing the $\alpha$-divergence with $\alpha = \infty$. We formulate the problem using the fractional covering framework and present a convergent sequential algorithm that only relies on solving a convex program at each iteration. Compared to previous methods, our approach results in a significantly smaller mixture of trees that provides similar or better accuracies. We demonstrate the usefulness of our approach by learning pictorial structures for face recognition.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

📈 Trend Setter — Variational Inference

🧭 Keyword Pioneer — mixture of trees

🐣 Hot Topic Early Bird — variational inference

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

M. P. Kumar , Daphne Koller

Topics

Machine Learning > Core Methods > Clustering Deep Learning > Models > Variational Inference Computer Vision > Analysis > Face Recognition Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Core Methods > Optimization

Keywords

probabilistic modeling variational inference em algorithm face recognition mixture of trees graphical model mixture model pictorial structure tree distribution

Download PDF

Related papers

Solving Stochastic Games 2009

Bilinear classifiers for visual recognition 2009

Zero-shot Learning with Semantic Output Codes 2009

Matrix Completion from Power-Law Distributed Samples 2009

Heavy-Tailed Symmetric Stochastic Neighbor Embedding 2009