WIQA: A dataset for “What if...” reasoning over procedural text

Niket Tandon; Bhavana Dalvi; Keisuke Sakaguchi; Peter Clark; Antoine Bosselut

2019 EMNLP EMNLP 2019

WIQA: A dataset for “What if...” reasoning over procedural text

Abstract

AbstractWe introduce WIQA, the first large-scale dataset of “What if...” questions over procedural text. WIQA contains a collection of paragraphs, each annotated with multiple influence graphs describing how one change affects another, and a large (40k) collection of “What if...?” multiple-choice questions derived from these. For example, given a paragraph about beach erosion, would stormy weather hasten or decelerate erosion? WIQA contains three kinds of questions: perturbations to steps mentioned in the paragraph; external (out-of-paragraph) perturbations requiring commonsense knowledge; and irrelevant (no effect) perturbations. We find that state-of-the-art models achieve 73.8% accuracy, well below the human performance of 96.3%. We analyze the challenges, in particular tracking chains of influences, and present the dataset as an open challenge to the community.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Healthcare & Medicine and Interdisciplinary and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Niket Tandon , Bhavana Dalvi , Keisuke Sakaguchi , Peter Clark , Antoine Bosselut

Topics

Natural Language Processing > Applications > Question Answering Healthcare & Medicine > Clinical > Clinical NLP Interdisciplinary > Cognitive Science > Cognitive Modeling Artificial Intelligence > Core AI > Reasoning Natural Language Processing > Applications > Natural Language Inference

Keywords

question answering multiple choice commonsense reasoning multiple-choice question procedural text influence graph

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019