Conditional Response Augmentation for Dialogue Using Knowledge Distillation

Myeongho Jeong; Seungtaek Choi; Hojae Han; Kyungho Kim; Seung-won Hwang

2020 INTERSPEECH INTERSPEECH 2020

Conditional Response Augmentation for Dialogue Using Knowledge Distillation

Abstract

This paper studies dialogue response selection task. As state-of-the-arts are neural models requiring a large training set, data augmentation is essential to overcome the sparsity of observational annotation, where one observed response is annotated as gold. In this paper, we propose counterfactual augmentation, of considering whether unobserved utterances would “counterfactually” replace the labelled response, for the given context, and augment only if that is the case. We empirically show that our pipeline improves BERT-based models in two different response selection tasks without incurring annotation overheads.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — counterfactual augmentation

🐣 Hot Topic Early Bird — dialogue system

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Myeongho Jeong , Seungtaek Choi , Hojae Han , Kyungho Kim , Seung-won Hwang

Topics

Machine Learning > Application Areas > Data Augmentation Machine Learning > Application Areas > Knowledge Distillation Natural Language Processing > Generation > Dialogue Systems Natural Language Processing > Applications > Dialogue Systems Machine Learning > Learning Types > Data Augmentation Deep Learning > Techniques > Knowledge Distillation

Keywords

knowledge distillation data augmentation counterfactual augmentation dialogue system response selection bert-based model

Download PDF

Related papers

Memory Controlled Sequential Self Attention for Sound Recognition 2020

Dual Attention in Time and Frequency Domain for Voice Activity Detection 2020

Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer 2020

A Noise Robust Technique for Detecting Vowels in Speech Signals 2020

Joint Detection of Sentence Stress and Phrase Boundary for Prosody 2020