Learning the Structure of Deep Sparse Graphical Models

Ryan P. Adams; Hanna Wallach; Zoubin Ghahramani

2010 AISTATS AISTATS 2010

Learning the Structure of Deep Sparse Graphical Models

Abstract

Deep belief networks are a powerful way to model complex probability distributions. However, it is difficult to learn the structure of a belief network, particularly one with hidden units. The Indian buffet process has been used as a nonparametric Bayesian prior on the structure of a directed belief network with a single infinitely wide hidden layer. Here, we introduce the cascading Indian buffet process (CIBP), which provides a prior on the structure of a layered, directed belief network that is unbounded in both depth and width, yet allows tractable inference. We use the CIBP prior with the nonlinear Gaussian belief network framework to allow each unit to vary its behavior between discrete and continuous representations. We use Markov chain Monte Carlo for inference in this model and explore the structures learned on image data.

🚀 Conference Pioneer — AISTATS 2010

🧭 Keyword Pioneer — gaussian belief network

🐣 Hot Topic Early Bird — markov chain monte carlo

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

🌉 Interdisciplinary Bridge — Computer Science and Machine Learning

Authors

Ryan P. Adams , Hanna Wallach , Zoubin Ghahramani

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Bayesian Inference Machine Learning > Optimization & Theory > Stochastic Processes Computer Science > Foundations > Algorithms

Keywords

bayesian nonparametrics nonparametric bayesian markov chain monte carlo indian buffet process graphical model deep belief network gaussian belief network

Download PDF

Related papers

Towards Understanding Situated Natural Language 2010

Mass Fatality Incident Identification based on nuclear DNA evidence 2010

Locally Linear Denoising on Image Manifolds 2010

Negative Results for Active Learning with Convex Losses 2010

Collaborative Filtering on a Budget 2010