Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models

Dinghan Shen; Asli Celikyilmaz; Yizhe Zhang; Liqun Chen; Xin Wang; Jianfeng Gao; Lawrence Carin

2019 ACL ACL 2019

Towards Generating Long and Coherent Text with Multi-Level Latent Variable Models

Abstract

AbstractVariational autoencoders (VAEs) have received much attention recently as an end-to-end architecture for text generation with latent variables. However, previous works typically focus on synthesizing relatively short sentences (up to 20 words), and the posterior collapse issue has been widely identified in text-VAEs. In this paper, we propose to leverage several multi-level structures to learn a VAE model for generating long, and coherent text. In particular, a hierarchy of stochastic layers between the encoder and decoder networks is employed to abstract more informative and semantic-rich latent codes. Besides, we utilize a multi-level decoder structure to capture the coherent long-term structure inherent in long-form texts, by generating intermediate sentence representations as high-level plan vectors. Extensive experimental results demonstrate that the proposed multi-level VAE model produces more coherent and less repetitive long text compared to baselines as well as can mitigate the posterior-collapse issue.

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — long text generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dinghan Shen , Asli Celikyilmaz , Yizhe Zhang , Liqun Chen , Xin Wang , Jianfeng Gao , Lawrence Carin

Topics

Deep Learning > Models > Generative Models Deep Learning > Models > Variational Inference Natural Language Processing > Generation > Text Generation

Keywords

latent variable variational autoencoder posterior collapse long text generation coherent text multi-level latent

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019