2021
EMNLP
EMNLP 2021
Multilingual Paraphrase Generation For Bootstrapping New Features in Task-Oriented Dialog Systems
Abstract
AbstractThe lack of labeled training data for new features is a common problem in rapidly changing real-world dialog systems. As a solution, we propose a multilingual paraphrase generation model that can be used to generate novel utterances for a target feature and target language. The generated utterances can be used to augment existing training data to improve intent classification and slot labeling models. We evaluate the quality of generated utterances using intrinsic evaluation metrics and by conducting downstream evaluation experiments with English as the source language and nine different target languages. Our method shows promise across languages, even in a zero-shot setting where no seed data is available.
🌉
Interdisciplinary Bridge
— Deep Learning and Machine Learning and Natural Language Processing
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Learning Types > Zero-Shot Learning
Natural Language Processing > Generation > Text Generation
Natural Language Processing > Resources & Methods > Multilingual NLP
Natural Language Processing > Applications > Dialogue Systems
Deep Learning > Learning Types > Transfer Learning
Deep Learning > Learning Types > Data Augmentation