Towards Objectively Evaluating the Quality of Generated Medical Summaries

Francesco Moramarco; Damir Juric; Aleksandar Savkov; Ehud Reiter

2021 EACL EACL 2021

Towards Objectively Evaluating the Quality of Generated Medical Summaries

Abstract

AbstractWe propose a method for evaluating the quality of generated text by asking evaluators to count facts, and computing precision, recall, f-score, and accuracy from the raw counts. We believe this approach leads to a more objective and easier to reproduce evaluation. We apply this to the task of medical report summarisation, where measuring objective quality and accuracy is of paramount importance.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Healthcare & Medicine and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — medical summarization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Francesco Moramarco , Damir Juric , Aleksandar Savkov , Ehud Reiter

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Core Methods > Representation Learning Machine Learning > Application Areas > Domain Adaptation Natural Language Processing > Applications > Summarization Natural Language Processing > Applications > Text Generation Healthcare & Medicine > Clinical > Medical AI

Keywords

fact verification natural language generation text generation text summarization evaluation metrics medical summarization fact extraction medical report precision recall

Download PDF

Related papers

Joint Coreference Resolution and Character Linking for Multiparty Conversation 2021

Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO 2021

Representations for Question Answering from Documents with Tables and Text 2021

Gender and Racial Fairness in Depression Research using Social Media 2021