Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study

Chinnadhurai Sankar; Sandeep Subramanian; Chris Pal; Sarath Chandar; Yoshua Bengio

2019 ACL ACL 2019

Do Neural Dialog Systems Use the Conversation History Effectively? An Empirical Study

Abstract

AbstractNeural generative models have been become increasingly popular when building conversational agents. They offer flexibility, can be easily adapted to new domains, and require minimal domain engineering. A common criticism of these systems is that they seldom understand or use the available dialog history effectively. In this paper, we take an empirical approach to understanding how these models use the available dialog history by studying the sensitivity of the models to artificially introduced unnatural changes or perturbations to their context at test time. We experiment with 10 different types of perturbations on 4 multi-turn dialog datasets and find that commonly used neural dialog architectures like recurrent and transformer-based seq2seq models are rarely sensitive to most perturbations such as missing or reordering utterances, shuffling words, etc. Also, by open-sourcing our code, we believe that it will serve as a useful diagnostic tool for evaluating dialog systems in the future.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — dialogue history

🐣 Hot Topic Early Bird — conversational agent

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Chinnadhurai Sankar , Sandeep Subramanian , Chris Pal , Sarath Chandar , Yoshua Bengio

Topics

Artificial Intelligence > Core AI > Agent Systems Deep Learning > Architectures > Neural Networks Natural Language Processing > Applications > Dialogue Systems Machine Learning > Optimization & Theory > Evaluation Machine Learning > Learning Types > Evaluation Deep Learning > Optimization & Theory > Evaluation Artificial Intelligence > Core AI > Dialogue Systems

Keywords

empirical study perturbation analysis empirical evaluation conversational agent dialogue history sequence-to-sequence model dialogue system seq2seq model context sensitivity empirical analysis conversation history neural network multi-turn dialog neural dialog system

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019