Rationally Reappraising ATIS-based Dialogue Systems

Jingcheng Niu; Gerald Penn

2019 ACL ACL 2019

Rationally Reappraising ATIS-based Dialogue Systems

Abstract

AbstractThe Air Travel Information Service (ATIS) corpus has been the most common benchmark for evaluating Spoken Language Understanding (SLU) tasks for more than three decades since it was released. Recent state-of-the-art neural models have obtained F1-scores near 98% on the task of slot filling. We developed a rule-based grammar for the ATIS domain that achieves a 95.82% F1-score on our evaluation set. In the process, we furthermore discovered numerous shortcomings in the ATIS corpus annotation, which we have fixed. This paper presents a detailed account of these shortcomings, our proposed repairs, our rule-based grammar and the neural slot-filling architectures associated with ATIS. We also rationally reappraise the motivations for choosing a neural architecture in view of this account. Fixing the annotation errors results in a relative error reduction of between 19.4 and 52% across all architectures. We nevertheless argue that neural models must play a different role in ATIS dialogues because of the latter’s lack of variety.

📈 Trend Setter — Natural Language Inference

🧭 Keyword Pioneer — rule-based grammar

🐣 Hot Topic Early Bird — dialogue system

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Machine Learning, Natural Language Processing, Reinforcement Learning, Speech & Audio

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing

Authors

Jingcheng Niu , Gerald Penn

Topics

Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Applications > Intent Classification Natural Language Processing > Resources & Methods > Natural Language Inference Natural Language Processing > Applications > Dialogue Systems Artificial Intelligence > Core AI > Language Natural Language Processing > Applications > Spoken Language Understanding

Keywords

intent classification spoken language understanding dialogue system slot filling rule-based grammar annotation error neural network

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019