Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space

Chunyuan Li; Xiang Gao; Yuan Li; Baolin Peng; Xiujun Li; Yizhe Zhang; Jianfeng Gao

2020 EMNLP EMNLP 2020

Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space

Abstract

AbstractWhen trained effectively, the Variational Autoencoder (VAE) can be both a powerful generative model and an effective representation learning framework for natural language. In this paper, we propose the first large-scale language VAE model Optimus (Organizing sentences via Pre-Trained Modeling of a Universal Space). A universal latent embedding space for sentences is first pre-trained on large text corpus, and then fine-tuned for various language generation and understanding tasks. Compared with GPT-2, Optimus enables guided language generation from an abstract level using the latent vectors. Compared with BERT, Optimus can generalize better on low-resource language understanding tasks due to the smooth latent space structure. Extensive experimental results on a wide range of language tasks demonstrate the effectiveness of Optimus. It achieves new state-of-the-art on VAE language modeling benchmarks.

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — guided language generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Chunyuan Li , Xiang Gao , Yuan Li , Baolin Peng , Xiujun Li , Yizhe Zhang , Jianfeng Gao

Topics

Deep Learning > Architectures > Autoencoders Deep Learning > Models > Generative Models Deep Learning > Models > Variational Inference Natural Language Processing > Generation > Language Modeling Natural Language Processing > Generation > Text Generation

Keywords

representation learning text generation language modeling variational autoencoder language model latent space pre-trained language model guided language generation

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020