Pushing the Limits of Radiology with Joint Modeling of Visual and Textual Information

Sonit Singh

2018 ACL ACL 2018

Pushing the Limits of Radiology with Joint Modeling of Visual and Textual Information

Abstract

AbstractRecently, there has been increasing interest in the intersection of computer vision and natural language processing. Researchers have studied several interesting tasks, including generating text descriptions from images and videos and language embedding of images. More recent work has further extended the scope of this area to combine videos and language, learning to solve non-visual tasks using visual cues, visual question answering, and visual dialog. Despite a large body of research on the intersection of vision-language technology, its adaption to the medical domain is not fully explored. To address this research gap, we aim to develop machine learning models that can reason jointly on medical images and clinical text for advanced search, retrieval, annotation and description of medical images.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Healthcare & Medicine and Natural Language Processing

📈 Trend Setter — Vision-Language Models

🐣 Hot Topic Early Bird — vision-language model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sonit Singh

Topics

Computer Vision > Domain-Specific > Medical Imaging Natural Language Processing > Applications > Information Retrieval Healthcare & Medicine > Clinical > Medical Imaging Artificial Intelligence > Core AI > Multi-Modal Learning Deep Learning > Models > Vision-Language Models

Keywords

medical imaging information retrieval multi-modal learning vision-language model visual-language model clinical text image description

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018