Posing Fair Generalization Tasks for Natural Language Inference

Atticus Geiger; Ignacio Cases; Lauri Karttunen; Christopher Potts

2019 IJCNLP IJCNLP 2019

Posing Fair Generalization Tasks for Natural Language Inference

Abstract

AbstractDeep learning models for semantics are generally evaluated using naturalistic corpora. Adversarial testing methods, in which models are evaluated on new examples with known semantic properties, have begun to reveal that good performance at these naturalistic tasks can hide serious shortcomings. However, we should insist that these evaluations be fair – that the models are given data sufficient to support the requisite kinds of generalization. In this paper, we define and motivate a formal notion of fairness in this sense. We then apply these ideas to natural language inference by constructing very challenging but provably fair artificial datasets and showing that standard neural models fail to generalize in the required ways; only task-specific models that jointly compose the premise and hypothesis are able to achieve high performance, and even these models do not solve the task perfectly.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Atticus Geiger , Ignacio Cases , Lauri Karttunen , Christopher Potts

Topics

Artificial Intelligence > Core AI > Interpretability Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Application Areas > Fairness Natural Language Processing > Applications > Natural Language Inference Artificial Intelligence > Core AI > Fairness Machine Learning > Learning Types > Fairness

Keywords

natural language inference adversarial testing neural model

Download PDF

Related papers

Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation 2019

Exploiting Monolingual Data at Scale for Neural Machine Translation 2019

Distributionally Robust Language Modeling 2019

Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling 2019

ARAML: A Stable Adversarial Training Framework for Text Generation 2019