Overview of the MedVidQA 2022 Shared Task on Medical Video Question-Answering

Deepak Gupta; Dina Demner-Fushman

2022 ACL ACL 2022

Overview of the MedVidQA 2022 Shared Task on Medical Video Question-Answering

Abstract

AbstractIn this paper, we present an overview of the MedVidQA 2022 shared task, collocated with the 21st BioNLP workshop at ACL 2022. The shared task addressed two of the challenges faced by medical video question answering: (I) a video classification task that explores new approaches to medical video understanding (labeling), and (ii) a visual answer localization task. Visual answer localization refers to the identification of the relevant temporal segments (start and end timestamps) in the video where the answer to the medical question is being shown or illustrated. A total of thirteen teams participated in the shared task challenges, with eleven system descriptions submitted to the workshop. The descriptions present monomodal and multi-modal approaches developed for medical video classification and visual answer localization. This paper describes the tasks, the datasets, evaluation metrics, and baseline systems for both tasks. Finally, the paper summarizes the techniques and results of the evaluation of the various approaches explored by the participating teams.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Healthcare & Medicine and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — visual answer localization

🐣 Hot Topic Early Bird — medical domain

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio