COGS: A Compositional Generalization Challenge Based on Semantic Interpretation

Najoung Kim; Tal Linzen

2020 EMNLP EMNLP 2020

COGS: A Compositional Generalization Challenge Based on Semantic Interpretation

Abstract

AbstractNatural language is characterized by compositionality: the meaning of a complex expression is constructed from the meanings of its constituent parts. To facilitate the evaluation of the compositional abilities of language processing architectures, we introduce COGS, a semantic parsing dataset based on a fragment of English. The evaluation portion of COGS contains multiple systematic gaps that can only be addressed by compositional generalization; these include new combinations of familiar syntactic structures, or new combinations of familiar words and familiar structures. In experiments with Transformers and LSTMs, we found that in-distribution accuracy on the COGS test set was near-perfect (96–99%), but generalization accuracy was substantially lower (16–35%) and showed high sensitivity to random seed (+-6–8%). These findings indicate that contemporary standard NLP models are limited in their compositional generalization capacity, and position COGS as a good way to measure progress.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐣 Hot Topic Early Bird — compositional generalization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Najoung Kim , Tal Linzen

Topics

Machine Learning > Optimization & Theory > Theory Natural Language Processing > Understanding > Semantic Analysis Natural Language Processing > Applications > Information Extraction Machine Learning > Learning Types > Evaluation Deep Learning > Models > Transformers Natural Language Processing > Applications > Semantic Parsing Artificial Intelligence > Core AI > Language Machine Learning > Learning Types > Generalization

Keywords

semantic parsing compositional generalization natural language long short-term memory evaluation benchmark neural network

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020