Efficient Low Rank Gaussian Variational Inference for Neural Networks

Marcin Tomczak; Siddharth Swaroop; Richard Turner

2020 NIPS NeurIPS 2020

Efficient Low Rank Gaussian Variational Inference for Neural Networks

Abstract

Bayesian neural networks are enjoying a renaissance driven in part by recent advances in variational inference (VI). The most common form of VI employs a fully factorized or mean-field distribution, but this is known to suffer from several pathologies, especially as we expect posterior distributions with highly correlated parameters. Current algorithms that capture these correlations with a Gaussian approximating family are difficult to scale to large models due to computational costs and high variance of gradient updates. By using a new form of the reparametrization trick, we derive a computationally efficient algorithm for performing VI with a Gaussian family with a low-rank plus diagonal covariance structure. We scale to deep feed-forward and convolutional architectures. We find that adding low-rank terms to parametrized diagonal covariance does not improve predictive performance except on small networks, but low-rank terms added to a constant diagonal covariance improves performance on small and large-scale network architectures.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🧭 Keyword Pioneer — gaussian variational inference

🐣 Hot Topic Early Bird — gaussian distribution

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Marcin Tomczak , Siddharth Swaroop , Richard Turner

Topics

Artificial Intelligence > Bayesian & Probabilistic > Bayesian Learning Deep Learning > Models > Variational Inference Deep Learning > Techniques > Model Architecture Deep Learning > Optimization & Theory > Optimization Machine Learning > Bayesian & Probabilistic > Variational Inference

Keywords

variational inference low-rank approximation gaussian distribution bayesian neural network gaussian variational inference reparameterization trick low-rank covariance

Download PDF

Related papers

Higher-Order Spectral Clustering of Directed Graphs 2020

Self-Supervised MultiModal Versatile Networks 2020

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates 2020

Causal Intervention for Weakly-Supervised Semantic Segmentation 2020

Taming Discrete Integration via the Boon of Dimensionality 2020