Towards Less Generic Responses in Neural Conversation Models: A Statistical Re-weighting Method

Yahui Liu; Wei Bi; Jun Gao; Xiaojiang Liu; Jian Yao; Shuming Shi

2018 EMNLP EMNLP 2018

Towards Less Generic Responses in Neural Conversation Models: A Statistical Re-weighting Method

Abstract

AbstractSequence-to-sequence neural generation models have achieved promising performance on short text conversation tasks. However, they tend to generate generic/dull responses, leading to unsatisfying dialogue experience. We observe that in the conversation tasks, each query could have multiple responses, which forms a 1-to-n or m-to-n relationship in the view of the total corpus. The objective function used in standard sequence-to-sequence models will be dominated by loss terms with generic patterns. Inspired by this observation, we introduce a statistical re-weighting method that assigns different weights for the multiple responses of the same query, and trains the common neural generation model with the weights. Experimental results on a large Chinese dialogue corpus show that our method improves the acceptance rate of generated responses compared with several baseline models and significantly reduces the number of generated generic responses.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — statistical re-weighting

🐣 Hot Topic Early Bird — conversational ai

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yahui Liu , Wei Bi , Jun Gao , Xiaojiang Liu , Jian Yao , Shuming Shi

Topics

Machine Learning > Core Methods > Representation Learning Natural Language Processing > Applications > Dialogue Systems

Keywords

conversational ai response generation text generation sequence-to-sequence model dialogue system statistical re-weighting

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018