2024
EMNLP
EMNLP 2024
Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes
Abstract
AbstractWe tackle societal bias in image-text datasets by removing spurious correlations between protected groups and image attributes. Traditional methods only target labeled attributes, ignoring biases from unlabeled ones. Using text-guided inpainting models, our approach ensures protected group independence from all attributes and mitigates inpainting biases through data filtering. Evaluations on multi-label image classification and image captioning tasks show our method effectively reduces bias without compromising performance across various models. Specifically, we achieve an average societal bias reduction of 46.1% in leakage-based bias metrics for multi-label classification and 74.8% for image captioning.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning
🧭
Keyword Pioneer
— societal bias mitigation
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Core Methods > Classification
Machine Learning > Application Areas > Fairness
Computer Vision > Generation > Image Captioning
Computer Vision > Processing > Image Restoration
Artificial Intelligence > Core AI > Fairness
Machine Learning > Learning Types > Fairness
Deep Learning > Learning Types > Multi-Modal Learning