MRC Examples Answerable by BERT without a Question Are Less Effective in MRC Model Training

Hongyu Li; Tengyang Chen; Shuting Bai; Takehito Utsuro; Yasuhide Kawada

2020 AACL AACL 2020

MRC Examples Answerable by BERT without a Question Are Less Effective in MRC Model Training

Abstract

AbstractModels developed for Machine Reading Comprehension (MRC) are asked to predict an answer from a question and its related context. However, there exist cases that can be correctly answered by an MRC model using BERT, where only the context is provided without including the question. In this paper, these types of examples are referred to as “easy to answer”, while others are as “hard to answer”, i.e., unanswerable by an MRC model using BERT without being provided the question. Based on classifying examples as answerable or unanswerable by BERT without the given question, we propose a method based on BERT that splits the training examples from the MRC dataset SQuAD1.1 into those that are “easy to answer” or “hard to answer”. Experimental evaluation from a comparison of two models, one trained only with “easy to answer” examples and the other with “hard to answer” examples demonstrates that the latter outperforms the former.

🚀 Conference Pioneer — AACL 2020

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing

📈 Trend Setter — Machine Reading Comprehension

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

Authors

Hongyu Li , Tengyang Chen , Shuting Bai , Takehito Utsuro , Yasuhide Kawada

Topics

Deep Learning > Architectures > Transformers Natural Language Processing > Applications > Machine Reading Comprehension

Keywords

text classification question answering machine reading comprehension

Download PDF

Related papers

Can Monolingual Pretrained Models Help Cross-Lingual Classification? 2020

Text Simplification with Reinforcement Learning Using Supervised Rewards on Grammaticality, Meaning Preservation, and Simplicity 2020

ISA: An Intelligent Shopping Assistant 2020

Social Media Medical Concept Normalization using RoBERTa in Ontology Enriched Text Similarity Framework 2020

Overcoming Resistance: The Normalization of an Amazonian Tribal Language 2020