Learning what to read: Focused machine reading

Enrique Noriega-Atala; Marco A. Valenzuela-Escárcega; Clayton Morrison; Mihai Surdeanu

2017 EMNLP EMNLP 2017

Learning what to read: Focused machine reading

Abstract

AbstractRecent efforts in bioinformatics have achieved tremendous progress in the machine reading of biomedical literature, and the assembly of the extracted biochemical interactions into large-scale models such as protein signaling pathways. However, batch machine reading of literature at today’s scale (PubMed alone indexes over 1 million papers per year) is unfeasible due to both cost and processing overhead. In this work, we introduce a focused reading approach to guide the machine reading of biomedical literature towards what literature should be read to answer a biomedical query as efficiently as possible. We introduce a family of algorithms for focused reading, including an intuitive, strong baseline, and a second approach which uses a reinforcement learning (RL) framework that learns when to explore (widen the search) or exploit (narrow it). We demonstrate that the RL approach is capable of answering more queries than the baseline, while being more efficient, i.e., reading fewer documents.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Healthcare & Medicine and Machine Learning and Natural Language Processing and Reinforcement Learning

🧭 Keyword Pioneer — document selection

🐣 Hot Topic Early Bird — document retrieval

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Enrique Noriega-Atala , Marco A. Valenzuela-Escárcega , Clayton Morrison , Mihai Surdeanu

Topics

Machine Learning > Learning Types > Active Learning Natural Language Processing > Applications > Information Extraction Reinforcement Learning > Methods > Deep RL Healthcare & Medicine > Research > Bioinformatics Machine Learning > Learning Types > Reinforcement Learning Artificial Intelligence > Core AI > Reasoning Artificial Intelligence > Core AI > Information Extraction

Keywords

deep reinforcement learning reinforcement learning information extraction document retrieval exploration-exploitation tradeoff actor-critic method biomedical literature document selection machine reading focused reading bioinformatics literature

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017