Improving Compositional Generalization in Semantic Parsing

Inbar Oren; Jonathan Herzig; Nitish Gupta; Matt Gardner; Jonathan Berant

2020 EMNLP EMNLP 2020

Improving Compositional Generalization in Semantic Parsing

Abstract

AbstractGeneralization of models to out-of-distribution (OOD) data has captured tremendous attention recently. Specifically, compositional generalization, i.e., whether a model generalizes to new structures built of components observed during training, has sparked substantial interest. In this work, we investigate compositional generalization in semantic parsing, a natural test-bed for compositional generalization, as output programs are constructed from sub-components. We analyze a wide variety of models and propose multiple extensions to the attention module of the semantic parser, aiming to improve compositional generalization. We find that the following factors improve compositional generalization: (a) using contextual representations, such as ELMo and BERT, (b) informing the decoder what input tokens have previously been attended to, (c) training the decoder attention to agree with pre-computed token alignments, and (d) downsampling examples corresponding to frequent program templates. While we substantially reduce the gap between in-distribution and OOD generalization, performance on OOD compositions is still substantially lower.

🧭 Keyword Pioneer — program template

🐣 Hot Topic Early Bird — out-of-distribution generalization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Inbar Oren , Jonathan Herzig , Nitish Gupta , Matt Gardner , Jonathan Berant

Topics

Natural Language Processing > Understanding > Parsing Natural Language Processing > Understanding > Semantic Analysis

Keywords

attention mechanism semantic parsing out-of-distribution generalization contextual representation compositional generalization program template

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020