Multimodal Generation of Radiology Reports using Knowledge-Grounded Extraction of Entities and Relations

Francesco Dalla Serra; William Clackett; Hamish MacKinnon; Chaoyang Wang; Fani Deligianni; Jeff Dalton; Alison Q. O’Neil

2022 AACL AACL 2022

Multimodal Generation of Radiology Reports using Knowledge-Grounded Extraction of Entities and Relations

Abstract

AbstractAutomated reporting has the potential to assist radiologists with the time-consuming procedure of generating text radiology reports. Most existing approaches generate the report directly from the radiology image, however we observe that the resulting reports exhibit realistic style but lack clinical accuracy. Therefore, we propose a two-step pipeline that subdivides the problem into factual triple extraction followed by free-text report generation. The first step comprises supervised extraction of clinically relevant structured information from the image, expressed as triples of the form (entity1, relation, entity2). In the second step, these triples are input to condition the generation of the radiology report. In particular, we focus our work on Chest X-Ray (CXR) radiology report generation. The proposed framework shows state-of-the-art results on the MIMIC-CXR dataset according to most of the standard text generation metrics that we employ (BLEU, METEOR, ROUGE) and to clinical accuracy metrics (recall, precision and F1 assessed using the CheXpert labeler), also giving a 23% reduction in the total number of errors and a 29% reduction in critical clinical errors as assessed by expert human evaluation. In future, this solution can easily integrate more advanced model architectures - to both improve the triple extraction and the report generation - and can be applied to other complex image captioning tasks, such as those found in the medical domain.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Healthcare & Medicine and Machine Learning and Natural Language Processing

🐣 Hot Topic Early Bird — chest x-ray

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Francesco Dalla Serra , William Clackett , Hamish MacKinnon , Chaoyang Wang , Fani Deligianni , Jeff Dalton , Alison Q. O’Neil

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Application Areas > Knowledge Distillation Computer Vision > Domain-Specific > Medical Imaging Natural Language Processing > Generation > Text Generation Healthcare & Medicine > Clinical > Medical Imaging Deep Learning > Learning Types > Knowledge Distillation

Keywords

medical imaging relation extraction text generation image captioning knowledge graph chest x-ray radiology report entity extraction knowledge-grounded generation clinical accuracy

Download PDF

Related papers

A Japanese Corpus of Many Specialized Domains for Word Segmentation and Part-of-Speech Tagging 2022

Enhancing Tabular Reasoning with Pattern Exploiting Training 2022

Re-contextualizing Fairness in NLP: The Case of India 2022

Adversarially Improving NMT Robustness to ASR Errors with Confusion Sets 2022

Promoting Pre-trained LM with Linguistic Features on Automatic Readability Assessment 2022