Reinforcement Learning with Imbalanced Dataset for Data-to-Text Medical Report Generation

Toru Nishino; Ryota Ozaki; Yohei Momoki; Tomoki Taniguchi; Ryuji Kano; Norihisa Nakano; Yuki Tagawa; Motoki Taniguchi; Tomoko Ohkuma; Keigo Nakamura

2020 EMNLP EMNLP 2020

Reinforcement Learning with Imbalanced Dataset for Data-to-Text Medical Report Generation

Abstract

AbstractAutomated generation of medical reports that describe the findings in the medical images helps radiologists by alleviating their workload. Medical report generation system should generate correct and concise reports. However, data imbalance makes it difficult to train models accurately. Medical datasets are commonly imbalanced in their finding labels because incidence rates differ among diseases; moreover, the ratios of abnormalities to normalities are significantly imbalanced. We propose a novel reinforcement learning method with a reconstructor to improve the clinical correctness of generated reports to train the data-to-text module with a highly imbalanced dataset. Moreover, we introduce a novel data augmentation strategy for reinforcement learning to additionally train the model on infrequent findings. From the perspective of a practical use, we employ a Two-Stage Medical Report Generator (TS-MRGen) for controllable report generation from input images. TS-MRGen consists of two separated stages: an image diagnosis module and a data-to-text module. Radiologists can modify the image diagnosis module results to control the reports that the data-to-text module generates. We conduct an experiment with two medical datasets to assess the data-to-text module and the entire two-stage model. Results demonstrate that the reports generated by our model describe the findings in the input image more correctly.

🌉 Interdisciplinary Bridge — Healthcare & Medicine and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — clinical correctness

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Toru Nishino , Ryota Ozaki , Yohei Momoki , Tomoki Taniguchi , Ryuji Kano , Norihisa Nakano , Yuki Tagawa , Motoki Taniguchi , Tomoko Ohkuma , Keigo Nakamura

Topics

Machine Learning > Application Areas > Data Augmentation Machine Learning > Application Areas > Risk Management Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications Healthcare & Medicine > Clinical > Medical AI

Keywords

reinforcement learning medical report generation imbalanced learning data-to-text generation imbalanced dataset clinical correctness data augmentation for reinforcement learning image diagnosis

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020