Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs

Djordje Miladinovic; Kumar Shridhar; Kushal Jain; Max Paulus; Joachim M Buhmann; Carl Allen

2022 NIPS NeurIPS 2022

Learning to Drop Out: An Adversarial Approach to Training Sequence VAEs

Abstract

In principle, applying variational autoencoders (VAEs) to sequential data offers a method for controlled sequence generation, manipulation, and structured representation learning. However, training sequence VAEs is challenging: autoregressive decoders can often explain the data without utilizing the latent space, known as posterior collapse. To mitigate this, state-of-the-art models weaken' thepowerful decoder' by applying uniformly random dropout to the decoder input.We show theoretically that this removes pointwise mutual information provided by the decoder input, which is compensated for by utilizing the latent space. We then propose an adversarial training strategy to achieve information-based stochastic dropout. Compared to uniform dropout on standard text benchmark datasets, our targeted approach increases both sequence modeling performance and the information captured in the latent space.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Djordje Miladinovic , Kumar Shridhar , Kushal Jain , Max Paulus , Joachim M Buhmann , Carl Allen

Topics

Machine Learning > Learning Types > Adversarial Learning Deep Learning > Models > Variational Inference Machine Learning > Learning Types > Representation Learning Deep Learning > Learning Types > Adversarial Learning Artificial Intelligence > Bayesian & Probabilistic > Variational Inference Deep Learning > Learning Types > Variational Inference

Keywords

sequence generation adversarial training variational autoencoder latent space sequence model posterior collapse

Download PDF

Related papers

Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching 2022

A Theoretical View on Sparsely Activated Networks 2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks 2022

Matryoshka Representation Learning 2022

Off-Policy Evaluation with Deficient Support Using Side Information 2022