2020 EMNLP EMNLP 2020

Querying Across Genres for Medical Claims in News

Abstract

AbstractWe present a query-based biomedical information retrieval task across two vastly different genres – newswire and research literature – where the goal is to find the research publication that supports the primary claim made in a health-related news article. For this task, we present a new dataset of 5,034 claims from news paired with research abstracts. Our approach consists of two steps: (i) selecting the most relevant candidates from a collection of 222k research abstracts, and (ii) re-ranking this list. We compare the classical IR approach using BM25 with more recent transformer-based models. Our results show that cross-genre medical IR is a viable task, but incorporating domain-specific knowledge is crucial.

🌉 Interdisciplinary Bridge — Computer Science and Data Science & Analytics and Healthcare & Medicine and Natural Language Processing
🧭 Keyword Pioneer — medical claim
🐝 Cross-Pollinator — Computer Science, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Natural Language Processing