Getting Closer to AI Complete Question Answering: A Set of Prerequisite Real Tasks

Anna Rogers; Olga Kovaleva; Matthew Downey; Anna Rumshisky

2020 AAAI AAAI 2020

Getting Closer to AI Complete Question Answering: A Set of Prerequisite Real Tasks

Abstract

Abstract The recent explosion in question answering research produced a wealth of both factoid reading comprehension (RC) and commonsense reasoning datasets. Combining them presents a different kind of task: deciding not simply whether information is present in the text, but also whether a confident guess could be made for the missing information. We present QuAIL, the first RC dataset to combine text-based, world knowledge and unanswerable questions, and to provide question type annotation that would enable diagnostics of the reasoning strategies by a given QA system. QuAIL contains 15K multi-choice questions for 800 texts in 4 domains. Crucially, it offers both general and text-specific questions, unlikely to be found in pretraining data. We show that QuAIL poses substantial challenges to the current state-of-the-art systems, with a 30% drop in accuracy compared to the most similar existing dataset.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — diagnostic evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Anna Rogers , Olga Kovaleva , Matthew Downey , Anna Rumshisky

Topics

Machine Learning > Learning Types > Self-Supervised Learning Natural Language Processing > Applications > Question Answering Machine Learning > Core Methods > Evaluation Deep Learning > Learning Types > Multi-Domain Learning

Keywords

question answering reading comprehension commonsense reasoning unanswerable question world knowledge diagnostic evaluation

Download PDF

Related papers

Enhancing Pointer Network for Sentence Ordering with Pairwise Ordering Predictions 2020

CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning 2020

Neural Simile Recognition with Cyclic Multitask Learning and Local Attention 2020

Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy 2020

Multi-Point Semantic Representation for Intent Classification 2020