Dimensionality-Driven Learning with Noisy Labels

Xingjun Ma; Yisen Wang; Michael E. Houle; Shuo Zhou; Sarah Erfani; Shutao Xia; Sudanthi Wijewickrema; James Bailey

2018 ICML ICML 2018

Dimensionality-Driven Learning with Noisy Labels

Abstract

Datasets with significant proportions of noisy (incorrect) class labels present challenges for training accurate Deep Neural Networks (DNNs). We propose a new perspective for understanding DNN generalization for such datasets, by investigating the dimensionality of the deep representation subspace of training samples. We show that from a dimensionality perspective, DNNs exhibit quite distinctive learning styles when trained with clean labels versus when trained with a proportion of noisy labels. Based on this finding, we develop a new dimensionality-driven learning strategy, which monitors the dimensionality of subspaces during training and adapts the loss function accordingly. We empirically demonstrate that our approach is highly tolerant to significant proportions of noisy labels, and can effectively learn low-dimensional local subspaces that capture the data distribution.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🐣 Hot Topic Early Bird — loss function

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xingjun Ma , Yisen Wang , Michael E. Houle , Shuo Zhou , Sarah Erfani , Shutao Xia , Sudanthi Wijewickrema , James Bailey

Topics

Machine Learning > Core Methods > Classification Machine Learning > Application Areas > Data Augmentation Deep Learning > Architectures > Neural Networks

Keywords

deep neural network loss function noisy label representation subspace

Download PDF

Related papers

Rectify Heterogeneous Models with Semantic Mapping 2018

Bayesian Optimization of Combinatorial Structures 2018

The Well-Tempered Lasso 2018

Approximation Algorithms for Cascading Prediction Models 2018

Classification from Pairwise Similarity and Unlabeled Data 2018