SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Harrison Lee; Raghav Gupta; Abhinav Rastogi; Yuan Cao; Bin Zhang; Yonghui Wu

2022 AAAI AAAI 2022

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Abstract

Abstract Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling models to support any service in zero-shot through schemas, which describe service APIs to models in natural language. We explore the robustness of dialogue systems to linguistic variations in schemas by designing SGD-X - a benchmark extending SGD with semantically similar yet stylistically diverse variants for every schema. We observe that two top state tracking models fail to generalize well across schema variants, measured by joint goal accuracy and a novel metric for measuring schema sensitivity. Additionally, we present a simple model-agnostic data augmentation method to improve schema robustness.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Harrison Lee , Raghav Gupta , Abhinav Rastogi , Yuan Cao , Bin Zhang , Yonghui Wu

Topics

Machine Learning > Application Areas > Data Augmentation Machine Learning > Application Areas > Domain Generalization Natural Language Processing > Applications > Dialogue Systems Machine Learning > Learning Paradigms > Zero-Shot Learning Artificial Intelligence > Core AI > Dialogue Systems

Keywords

domain generalization data augmentation task-oriented dialogue dialogue state tracking zero-shot transfer schema-guided dialogue

Download PDF

Related papers

Dynamic Spatial Propagation Network for Depth Completion 2022

FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition 2022

Memory-Guided Semantic Learning Network for Temporal Sentence Grounding 2022

AnchorFace: Boosting TAR@FAR for Practical Face Recognition 2022

Parallel and High-Fidelity Text-to-Lip Generation 2022