ARAML: A Stable Adversarial Training Framework for Text Generation

Pei Ke; Fei Huang; Minlie Huang; Xiaoyan Zhu

2019 EMNLP EMNLP 2019

ARAML: A Stable Adversarial Training Framework for Text Generation

Abstract

AbstractMost of the existing generative adversarial networks (GAN) for text generation suffer from the instability of reinforcement learning training algorithms such as policy gradient, leading to unstable performance. To tackle this problem, we propose a novel framework called Adversarial Reward Augmented Maximum Likelihood (ARAML). During adversarial training, the discriminator assigns rewards to samples which are acquired from a stationary distribution near the data rather than the generator’s distribution. The generator is optimized with maximum likelihood estimation augmented by the discriminator’s rewards instead of policy gradient. Experiments show that our model can outperform state-of-the-art text GANs with a more stable training process.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — reward augmentation

🐣 Hot Topic Early Bird — maximum likelihood

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Pei Ke , Fei Huang , Minlie Huang , Xiaoyan Zhu

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Optimization & Theory > Optimization Reinforcement Learning > Methods > Policy Learning Deep Learning > Learning Types > Adversarial Learning Deep Learning > Learning Types > Generative Models Artificial Intelligence > Core AI > Natural Language Generation

Keywords

policy gradient text generation adversarial training maximum likelihood maximum likelihood estimation generative adversarial network training stability reward augmentation reward augmented

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019