Latent Retrieval for Weakly Supervised Open Domain Question Answering

Kenton Lee; Ming-Wei Chang; Kristina Toutanova

2019 ACL ACL 2019

Latent Retrieval for Weakly Supervised Open Domain Question Answering

Abstract

AbstractRecent work on open domain question answering (QA) assumes strong supervision of the supporting evidence and/or assumes a blackbox information retrieval (IR) system to retrieve evidence candidates. We argue that both are suboptimal, since gold evidence is not always available, and QA is fundamentally different from IR. We show for the first time that it is possible to jointly learn the retriever and reader from question-answer string pairs and without any IR system. In this setting, evidence retrieval from all of Wikipedia is treated as a latent variable. Since this is impractical to learn from scratch, we pre-train the retriever with an Inverse Cloze Task. We evaluate on open versions of five QA datasets. On datasets where the questioner already knows the answer, a traditional IR system such as BM25 is sufficient. On datasets where a user is genuinely seeking an answer, we show that learned retrieval is crucial, outperforming BM25 by up to 19 points in exact match.

🌱 Topic Pioneer — Retrieval-Augmented Generation

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

📈 Trend Setter — Retrieval-Augmented Generation

🧭 Keyword Pioneer — neural retriever

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Kenton Lee , Ming-Wei Chang , Kristina Toutanova

Topics

Machine Learning > Learning Types > Weakly Supervised Learning Natural Language Processing > Applications > Information Retrieval Natural Language Processing > Applications > Question Answering Natural Language Processing > Resources & Methods > Retrieval-Augmented Generation Machine Learning > Core Methods > Retrieval Machine Learning > Learning Paradigms > Weakly Supervised Learning Machine Learning > Learning Types > Retrieval

Keywords

weakly supervised learning information retrieval weak supervision evidence retrieval open domain question answering neural retriever inverse cloze task latent retrieval inverse cloze

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019