Generalizing Question Answering System with Pre-trained Language Model Fine-tuning

Dan Su; Yan Xu; Genta Indra Winata; Peng Xu; Hyeondey Kim; Zihan Liu; Pascale Fung

2019 EMNLP EMNLP 2019

Generalizing Question Answering System with Pre-trained Language Model Fine-tuning

Abstract

AbstractWith a large number of datasets being released and new techniques being proposed, Question answering (QA) systems have witnessed great breakthroughs in reading comprehension (RC)tasks. However, most existing methods focus on improving in-domain performance, leaving open the research question of how these mod-els and techniques can generalize to out-of-domain and unseen RC tasks. To enhance the generalization ability, we propose a multi-task learning framework that learns the shared representation across different tasks. Our model is built on top of a large pre-trained language model, such as XLNet, and then fine-tuned on multiple RC datasets. Experimental results show the effectiveness of our methods, with an average Exact Match score of 56.59 and an average F1 score of 68.98, which significantly improves the BERT-Large baseline by8.39 and 7.22, respectively

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐣 Hot Topic Early Bird — model generalization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dan Su , Yan Xu , Genta Indra Winata , Peng Xu , Hyeondey Kim , Zihan Liu , Pascale Fung

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Application Areas > Domain Generalization Natural Language Processing > Applications > Question Answering Artificial Intelligence > Learning Paradigms > Multi-Task Learning Artificial Intelligence > Core AI > Natural Language Processing Deep Learning > Learning Types > Fine-Tuning

Keywords

multi-task learning domain generalization transfer learning question answering reading comprehension machine reading comprehension model generalization pre-trained language model

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019