One Size Fits None: Rethinking Fairness in Medical AI

Roland Roller; Michael Hahn; Ajay Madhavan Ravichandran; Bilgin Osmanodja; Florian Oetke; Zeineb Sassi; Aljoscha Burchardt; Klaus Netter; Klemens Budde; Anne Herrmann; Tobias Strapatsas; Peter Dabrock; Sebastian Möller

2025 ACL ACL 2025

One Size Fits None: Rethinking Fairness in Medical AI

Abstract

AbstractMachine learning (ML) models are increasingly used to support clinical decision-making. However, real-world medical datasets are often noisy, incomplete, and imbalanced, leading to performance disparities across patient subgroups. These differences raise fairness concerns, particularly when they reinforce existing disadvantages for marginalized groups. In this work, we analyze several medical prediction tasks and demonstrate how model performance varies with patient characteristics. While ML models may demonstrate good overall performance, we argue that subgroup-level evaluation is essential before integrating them into clinical workflows. By conducting a performance analysis at the subgroup level, differences can be clearly identified—allowing, on the one hand, for performance disparities to be considered in clinical practice, and on the other hand, for these insights to inform the responsible development of more effective models. Thereby, our work contributes to a practical discussion around the subgroup-sensitive development and deployment of medical ML models and the interconnectedness of fairness and transparency.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Healthcare & Medicine and Machine Learning

🧭 Keyword Pioneer — subgroup evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Roland Roller , Michael Hahn , Ajay Madhavan Ravichandran , Bilgin Osmanodja , Florian Oetke , Zeineb Sassi , Aljoscha Burchardt , Klaus Netter , Klemens Budde , Anne Herrmann , Tobias Strapatsas , Peter Dabrock , Sebastian Möller

Topics

Machine Learning > Application Areas > Fairness Artificial Intelligence > Core AI > Fairness Healthcare & Medicine > Clinical > Medical AI Machine Learning > Learning Types > Fairness

Keywords

machine learning model performance clinical decision-making medical prediction performance disparity medical ai subgroup evaluation performance disparities

Download PDF

Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights 2025

CodeTool: Enhancing Programmatic Tool Invocation of LLMs via Process Supervision 2025

Structural Deep Encoding for Table Question Answering 2025

Vision-aided Unsupervised Constituency Parsing with Multi-MLLM Debating 2025

One Size Fits None: Rethinking Fairness in Medical AI

Abstract

Authors

Topics

Keywords

Related papers