Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models

Yuanlong Shao; Stephan Gouws; Denny Britz; Anna Goldie; Brian Strope; Ray Kurzweil

2017 EMNLP EMNLP 2017

Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models

Abstract

AbstractSequence-to-sequence models have been applied to the conversation response generation problem where the source sequence is the conversation history and the target sequence is the response. Unlike translation, conversation responding is inherently creative. The generation of long, informative, coherent, and diverse responses remains a hard task. In this work, we focus on the single turn setting. We add self-attention to the decoder to maintain coherence in longer responses, and we propose a practical approach, called the glimpse-model, for scaling to large datasets. We introduce a stochastic beam-search algorithm with segment-by-segment reranking which lets us inject diversity earlier in the generation process. We trained on a combined data set of over 2.3B conversation messages mined from the web. In human evaluation studies, our method produces longer responses overall, with a higher proportion rated as acceptable and excellent as length increases, compared to baseline sequence-to-sequence models with explicit length-promotion. A back-off strategy produces better responses overall, in the full spectrum of lengths.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🧭 Keyword Pioneer — stochastic beam search

🐣 Hot Topic Early Bird — response generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Yuanlong Shao , Stephan Gouws , Denny Britz , Anna Goldie , Brian Strope , Ray Kurzweil

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Neural Network Optimization Deep Learning > Architectures > Recurrent Neural Networks Artificial Intelligence > Core AI > Dialogue Systems

Keywords

natural language generation response generation beam search sequence-to-sequence model stochastic beam search conversational response generation conversation response glimpse model

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017