2021
EMNLP
EMNLP 2021
Progressive Transformer-Based Generation of Radiology Reports
Abstract
AbstractInspired by Curriculum Learning, we propose a consecutive (i.e., image-to-text-to-text) generation framework where we divide the problem of radiology report generation into two steps. Contrary to generating the full radiology report from the image at once, the model generates global concepts from the image in the first step and then reforms them into finer and coherent texts using transformer-based architecture. We follow the transformer-based sequence-to-sequence paradigm at each step. We improve upon the state-of-the-art on two benchmark datasets.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Computer Vision and Deep Learning and Healthcare & Medicine
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Multimodal Learning
Artificial Intelligence > Core AI > Procedural Generation
Deep Learning > Architectures > Transformers
Computer Vision > Domain-Specific > Medical Imaging
Healthcare & Medicine > Clinical > Medical Imaging
Deep Learning > Learning Types > Multi-Modal Learning
Computer Vision > Applications > Medical Imaging