SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

Rowan Zellers; Yonatan Bisk; Roy Schwartz; Yejin Choi

2018 EMNLP EMNLP 2018

SWAG: A Large-Scale Adversarial Dataset for Grounded Commonsense Inference

Abstract

AbstractGiven a partial description like “she opened the hood of the car,” humans can reason about the situation and anticipate what might come next (”then, she examined the engine”). In this paper, we introduce the task of grounded commonsense inference, unifying natural language inference and commonsense reasoning. We present SWAG, a new dataset with 113k multiple choice questions about a rich spectrum of grounded situations. To address the recurring challenges of the annotation artifacts and human biases found in many existing datasets, we propose Adversarial Filtering (AF), a novel procedure that constructs a de-biased dataset by iteratively training an ensemble of stylistic classifiers, and using them to filter the data. To account for the aggressive adversarial filtering, we use state-of-the-art language models to massively oversample a diverse set of potential counterfactuals. Empirical results demonstrate that while humans can solve the resulting inference problems with high accuracy (88%), various competitive models struggle on our task. We provide comprehensive analysis that indicates significant opportunities for future research.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — adversarial filtering

🐣 Hot Topic Early Bird — commonsense reasoning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Rowan Zellers , Yonatan Bisk , Roy Schwartz , Yejin Choi

Topics

Machine Learning > Learning Types > Adversarial Learning Artificial Intelligence > Core AI > Reasoning Natural Language Processing > Applications > Natural Language Inference Natural Language Processing > Understanding > Natural Language Inference Deep Learning > Learning Types > Adversarial Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

adversarial learning natural language inference language model multiple choice commonsense reasoning adversarial filtering grounded reasoning commonsense inference

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018