Incorporating Figure Captions and Descriptive Text in MeSH Term Indexing

Xindi Wang; Robert E. Mercer

2019 ACL ACL 2019

Incorporating Figure Captions and Descriptive Text in MeSH Term Indexing

Abstract

AbstractThe goal of text classification is to automatically assign categories to documents. Deep learning automatically learns effective features from data instead of adopting human-designed features. In this paper, we focus specifically on biomedical document classification using a deep learning approach. We present a novel multichannel TextCNN model for MeSH term indexing. Beyond the normal use of the text from the abstract and title for model training, we also consider figure and table captions, as well as paragraphs associated with the figures and tables. We demonstrate that these latter text sources are important feature sources for our method. A new dataset consisting of these text segments curated from 257,590 full text articles together with the articles’ MEDLINE/PubMed MeSH terms is publicly available.

🌉 Interdisciplinary Bridge — Deep Learning and Healthcare & Medicine and Natural Language Processing

🧭 Keyword Pioneer — multi-channel model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xindi Wang , Robert E. Mercer

Topics

Deep Learning > Architectures > Neural Networks Natural Language Processing > Applications > Information Extraction Natural Language Processing > Applications > Text Classification Healthcare & Medicine > Clinical > Medical Imaging Deep Learning > Learning Types > Deep Learning

Keywords

text classification deep learning document classification biomedical text mining multi-channel model mesh term indexing multichannel convolutional neural network mesh indexing

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019