Sparsity-Inducing Categorical Prior Improves Robustness of the Information Bottleneck

Anirban Samaddar; Sandeep Madireddy; Prasanna Balaprakash; Taps Maiti; Gustavo de los Campos; Ian Fischer

2023 AISTATS AISTATS 2023

Sparsity-Inducing Categorical Prior Improves Robustness of the Information Bottleneck

Abstract

The information bottleneck framework provides a systematic approach to learning representations that compress nuisance information in the input and extract semantically meaningful information about predictions. However, the choice of a prior distribution that fixes the dimensionality across all the data can restrict the flexibility of this approach for learning robust representations. We present a novel sparsity-inducing spike-slab categorical prior that uses sparsity as a mechanism to provide the flexibility that allows each data point to learn its own dimension distribution. In addition, it provides a mechanism for learning a joint distribution of the latent variable and the sparsity, and hence it can account for the complete uncertainty in the latent space. Through a series of experiments using in-distribution and out-of-distribution learning scenarios on the MNIST, CIFAR-10, and ImageNet data, we show that the proposed approach improves accuracy and robustness compared to traditional fixed-dimensional priors, as well as other sparsity induction mechanisms for latent variable models proposed in the literature.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — sparsity-inducing prior

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Anirban Samaddar , Sandeep Madireddy , Prasanna Balaprakash , Taps Maiti , Gustavo de los Campos , Ian Fischer

Topics

Machine Learning > Core Methods > Representation Learning Deep Learning > Models > Generative Models Deep Learning > Models > Variational Inference Machine Learning > Core Methods > Probabilistic Modeling Deep Learning > Learning Types > Representation Learning

Keywords

representation learning information bottleneck latent variable model variational autoencoder sparsity-inducing prior out-of-distribution robustness

Download PDF

Related papers

Safe Sequential Testing and Effect Estimation in Stratified Count Data 2023

Who Should Predict? Exact Algorithms For Learning to Defer to Humans 2023

An Online and Unified Algorithm for Projection Matrix Vector Multiplication with Application to Empirical Risk Minimization 2023

Stochastic Gradient Descent-Ascent: Unified Theory and New Efficient Methods 2023

The Ordered Matrix Dirichlet for State-Space Models 2023