Neural Empirical Bayes

Saeed Saremi; Aapo Hyvärinen

2019 JMLR JMLR 2019

Neural Empirical Bayes

Abstract

We unify kernel density estimation and empirical Bayes and address a set of problems in unsupervised machine learning with a geometric interpretation of those methods, rooted in the concentration of measure phenomenon. Kernel density is viewed symbolically as $X\rightharpoonup Y$ where the random variable $X$ is smoothed to $Y= X+N(0,\sigma^2 I_d)$, and empirical Bayes is the machinery to denoise in a least-squares sense, which we express as $X \leftharpoondown Y$. A learning objective is derived by combining these two, symbolically captured by $X \rightleftharpoons Y$. Crucially, instead of using the original nonparametric estimators, we parametrize the energy function with a neural network denoted by $\phi$; at optimality, $\nabla \phi \approx -\nabla \log f$ where $f$ is the density of $Y$. The optimization problem is abstracted as interactions of high-dimensional spheres which emerge due to the concentration of isotropic Gaussians. We introduce two algorithmic frameworks based on this machinery: (i) a “walk-jump” sampling scheme that combines Langevin MCMC (walks) and empirical Bayes (jumps), and (ii) a probabilistic framework for associative memory, called NEBULA, defined a la Hopfield by the gradient flow of the learned energy to a set of attractors. We finish the paper by reporting the emergence of very rich “creative memories” as attractors of NEBULA for highly-overlapping spheres. [abs] [ pdf ][ bib ] © JMLR 2019. (edit, beta)

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — langevin mcmc

🐣 Hot Topic Early Bird — unsupervised learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Saeed Saremi , Aapo Hyvärinen

Topics

Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Learning Types > Representation Learning Machine Learning > Bayesian & Probabilistic > Bayesian Inference

Keywords

unsupervised learning kernel density estimation associative memory empirical baye energy-based model langevin mcmc neural network

Download PDF

Related papers

Adaptation Based on Generalized Discrepancy 2019

Iterated Learning in Dynamic Social Networks 2019

Pyro: Deep Universal Probabilistic Programming 2019

Matched Bipartite Block Model with Covariates 2019

Approximation Hardness for A Class of Sparse Optimization Problems 2019