2020
EMNLP
EMNLP 2020
A Corpus for Outbreak Detection of Diseases Prevalent in Latin America
Abstract
AbstractIn this paper we present an annotated corpus which can be used for training and testing algorithms to automatically extract information about diseases outbreaks from news and health reports. We also propose initial approaches to extract information from it. The corpus has been constructed with two main tasks in mind. The first one, to extract entities about outbreaks such as disease, host, location among others. The second one, to retrieve relations among entities, for instance, in such geographic location fifteen cases of a given disease were reported. Overall, our goal is to offer resources and tools to perform an automated analysis so as to support early detection of disease outbreaks and therefore diminish their spreading.
🌉
Interdisciplinary Bridge
— Computer Science and Data Science & Analytics and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— disease outbreak
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Representation Learning
Natural Language Processing > Applications > Information Extraction
Data Science & Analytics > Applications > Disease Surveillance
Computer Science > Applications > Document Analysis
Natural Language Processing > Applications > Named Entity Recognition