Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

Shuming Ma; Xu Sun; Wei Li; Sujian Li; Wenjie Li; Xuancheng Ren

2018 NAACL NAACL 2018

Query and Output: Generating Words by Querying Distributed Word Representations for Paraphrase Generation

Abstract

AbstractMost recent approaches use the sequence-to-sequence model for paraphrase generation. The existing sequence-to-sequence model tends to memorize the words and the patterns in the training dataset instead of learning the meaning of the words. Therefore, the generated sentences are often grammatically correct but semantically improper. In this work, we introduce a novel model based on the encoder-decoder framework, called Word Embedding Attention Network (WEAN). Our proposed model generates the words by querying distributed word representations (i.e. neural word embeddings), hoping to capturing the meaning of the according words. Following previous work, we evaluate our model on two paraphrase-oriented tasks, namely text simplification and short text abstractive summarization. Experimental results show that our model outperforms the sequence-to-sequence baseline by the BLEU score of 6.3 and 5.5 on two English text simplification datasets, and the ROUGE-2 F1 score of 5.7 on a Chinese summarization dataset. Moreover, our model achieves state-of-the-art performances on these three benchmark datasets.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — word embedding attention

🐣 Hot Topic Early Bird — paraphrase generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Shuming Ma , Xu Sun , Wei Li , Sujian Li , Wenjie Li , Xuancheng Ren

Topics

Deep Learning > Architectures > Neural Networks Natural Language Processing > Generation > Text Generation Machine Learning > Learning Types > Representation Learning

Keywords

paraphrase generation text simplification sequence-to-sequence model encoder-decoder framework distributed word representation word embedding attention

Download PDF

Related papers

A Melody-Conditioned Lyrics Language Model 2018

Before Name-Calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation 2018

Automated Essay Scoring in the Presence of Biased Ratings 2018

Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input 2018

QuickEdit: Editing Text & Translations by Crossing Words Out 2018