Testing Cross-Database Semantic Parsers With Canonical Utterances

Heather Lent; Semih Yavuz; Tao Yu; Tong Niu; Yingbo Zhou; Dragomir Radev; Xi Victoria Lin

2021 EMNLP EMNLP 2021

Testing Cross-Database Semantic Parsers With Canonical Utterances

Abstract

AbstractThe benchmark performance of cross-database semantic parsing has climbed steadily in recent years, catalyzed by the wide adoption of pre-trained language models. Yet existing work have shown that state-of-the-art cross-database semantic parsers struggle to generalize to novel user utterances, databases and query structures. To obtain transparent details on the strengths and limitation of these models, we propose a diagnostic testing approach based on controlled synthesis of canonical natural language and SQL pairs. Inspired by the CheckList, we characterize a set of essential capabilities for cross-database semantic parsing models, and detailed the method for synthesizing the corresponding test data. We evaluated a variety of high performing models using the proposed approach, and identified several non-obvious weaknesses across models (e.g. unable to correctly select many columns). Our dataset and code are released as a test suite at http://github.com/hclent/BehaviorCheckingSemPar.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Evaluation

🧭 Keyword Pioneer — language model generalization

🐣 Hot Topic Early Bird — sql generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Heather Lent , Semih Yavuz , Tao Yu , Tong Niu , Yingbo Zhou , Dragomir Radev , Xi Victoria Lin

Topics

Machine Learning > Learning Types > Weakly Supervised Learning Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Applications > Machine Reading Comprehension Machine Learning > Learning Types > Evaluation Natural Language Processing > Applications > Semantic Parsing Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Learning Types > Evaluation

Keywords

semantic parsing sql generation language model language model generalization cross-database parsing sql synthesis canonical utterance diagnostic testing

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021