Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering

Jinhyuk Lee; Seongjun Yun; Hyunjae Kim; Miyoung Ko; Jaewoo Kang

2018 EMNLP EMNLP 2018

Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering

Abstract

AbstractRecently, open-domain question answering (QA) has been combined with machine comprehension models to find answers in a large knowledge source. As open-domain QA requires retrieving relevant documents from text corpora to answer questions, its performance largely depends on the performance of document retrievers. However, since traditional information retrieval systems are not effective in obtaining documents with a high probability of containing answers, they lower the performance of QA systems. Simply extracting more documents increases the number of irrelevant documents, which also degrades the performance of QA systems. In this paper, we introduce Paragraph Ranker which ranks paragraphs of retrieved documents for a higher answer recall with less noise. We show that ranking paragraphs and aggregating answers using Paragraph Ranker improves performance of open-domain QA pipeline on the four open-domain QA datasets by 7.8% on average.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Machine Learning and Natural Language Processing

📈 Trend Setter — Information Retrieval

🧭 Keyword Pioneer — paragraph ranking

🐣 Hot Topic Early Bird — document retrieval

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Jinhyuk Lee , Seongjun Yun , Hyunjae Kim , Miyoung Ko , Jaewoo Kang

Topics

Natural Language Processing > Applications > Information Retrieval Natural Language Processing > Applications > Question Answering Data Science & Analytics > Applications > Information Retrieval Machine Learning > Application Areas > Information Retrieval

Keywords

information retrieval document retrieval open-domain question answering paragraph ranking answer recall

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018