ReQA: An Evaluation for End-to-End Answer Retrieval Models

Amin Ahmad; Noah Constant; Yinfei Yang; Daniel Cer

2019 EMNLP EMNLP 2019

ReQA: An Evaluation for End-to-End Answer Retrieval Models

Abstract

AbstractPopular QA benchmarks like SQuAD have driven progress on the task of identifying answer spans within a specific passage, with models now surpassing human performance. However, retrieving relevant answers from a huge corpus of documents is still a challenging problem, and places different requirements on the model architecture. There is growing interest in developing scalable answer retrieval models trained end-to-end, bypassing the typical document retrieval step. In this paper, we introduce Retrieval Question-Answering (ReQA), a benchmark for evaluating large-scale sentence-level answer retrieval models. We establish baselines using both neural encoding models as well as classical information retrieval techniques. We release our evaluation code to encourage further work on this challenging task.

🧭 Keyword Pioneer — answer retrieval

🐣 Hot Topic Early Bird — document retrieval

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Amin Ahmad , Noah Constant , Yinfei Yang , Daniel Cer

Topics

Natural Language Processing > Applications > Information Retrieval Natural Language Processing > Applications > Question Answering

Keywords

neural encoding question answering information retrieval document retrieval evaluation benchmark neural retrieval answer retrieval sentence-level retrieval

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019