Toward Robustness in Multi-Label Classification: A Data Augmentation Strategy against Imbalance and Noise

Hwanjun Song; Minseok Kim; Jae-Gil Lee

2024 AAAI AAAI 2024

Toward Robustness in Multi-Label Classification: A Data Augmentation Strategy against Imbalance and Noise

Abstract

Abstract Multi-label classification poses challenges due to imbalanced and noisy labels in training data. In this paper, we propose a unified data augmentation method, named BalanceMix, to address these challenges. Our approach includes two samplers for imbalanced labels, generating minority-augmented instances with high diversity. It also refines multi-labels at the label-wise granularity, categorizing noisy labels as clean, re-labeled, or ambiguous for robust optimization. Extensive experiments on three benchmark datasets demonstrate that BalanceMix outperforms existing state-of-the-art methods. We release the code at https://github.com/DISL-Lab/BalanceMix.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hwanjun Song , Minseok Kim , Jae-Gil Lee

Topics

Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Weakly Supervised Learning Machine Learning > Application Areas > Data Augmentation Machine Learning > Learning Types > Deep Learning Machine Learning > Learning Types > Multi-Label Classification Machine Learning > Learning Types > Data Augmentation Machine Learning > Learning Types > Robust Learning

Keywords

robust optimization data augmentation multi-label classification class imbalance label noise

Download PDF

Related papers

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI 2024

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Context Variables 2024

Suppressing Uncertainty in Gaze Estimation 2024

Mask-Homo: Pseudo Plane Mask-Guided Unsupervised Multi-Homography Estimation 2024

Heterogeneous Test-Time Training for Multi-Modal Person Re-identification 2024