Variational Memory Encoder-Decoder

Hung Le; Truyen Tran; Thin Nguyen; Svetha Venkatesh

2018 NIPS NeurIPS 2018

Variational Memory Encoder-Decoder

Abstract

Introducing variability while maintaining coherence is a core task in learning to generate utterances in conversation. Standard neural encoder-decoder models and their extensions using conditional variational autoencoder often result in either trivial or digressive responses. To overcome this, we explore a novel approach that injects variability into neural encoder-decoder via the use of external memory as a mixture model, namely Variational Memory Encoder-Decoder (VMED). By associating each memory read with a mode in the latent mixture distribution at each timestep, our model can capture the variability observed in sequential data such as natural conversations. We empirically compare the proposed model against other recent approaches on various conversational datasets. The results show that VMED consistently achieves significant improvement over others in both metric-based and qualitative evaluations.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — conversational generation

🐣 Hot Topic Early Bird — dialogue generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hung Le , Truyen Tran , Thin Nguyen , Svetha Venkatesh

Topics

Artificial Intelligence > Core AI > Foundation Models Artificial Intelligence > Core AI > Memory Deep Learning > Architectures > Autoencoders Deep Learning > Models > Variational Inference Natural Language Processing > Generation > Dialogue Systems

Keywords

variational inference dialogue generation latent variable model mixture model latent variable variational autoencoder memory network external memory conversational system conversational generation

Download PDF

Related papers

Maximum Causal Tsallis Entropy Imitation Learning 2018

Recurrent World Models Facilitate Policy Evolution 2018

Bandit Learning in Concave N-Person Games 2018

Algorithmic Assurance: An Active Approach to Algorithmic Testing using Bayesian Optimisation 2018

PAC-Bayes bounds for stable algorithms with instance-dependent priors 2018