Efficient Multi-label Classification with Many Labels

Wei Bi; James Kwok

2013 ICML ICML 2013

Efficient Multi-label Classification with Many Labels

Abstract

Multi-label classification deals with the problem where each instance can be associated with a set of class labels. However, in many real-world applications, the number of class labels can be in the hundreds or even thousands, and existing multi-label classification methods often become computationally inefficient. In recent years, a number of remedies have been proposed. However, they are either based on simple dimension reduction techniques or involve expensive optimization problems. In this paper, we address this problem by selecting a small subset of class labels that can approximately span the original label space. This is performed by randomized sampling where the sampling probability of each class label reflects its importance among all the labels. Theoretical analysis shows that this randomized sampling approach is highly efficient. Experiments on a number of real-world multi-label datasets with many labels demonstrate the appealing performance and efficiency of the proposed algorithm.

🚀 Conference Pioneer — ICML 2013

🧭 Keyword Pioneer — randomized sampling

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio

📈 Trend Setter — Multi-Label Classification

🐣 Hot Topic Early Bird — multi-label classification

Authors

Wei Bi , James Kwok

Topics

Machine Learning > Core Methods > Classification Machine Learning > Application Areas > Efficient Computing Machine Learning > Core Methods > Dimensionality Reduction Machine Learning > Learning Types > Multi-Label Classification

Keywords

dimensionality reduction multi-label classification label embedding randomized sampling label space reduction high-dimensional label

Download PDF

Related papers

Convex Adversarial Collective Classification 2013

Gaussian Process Vine Copulas for Multivariate Dependence 2013

Stochastic Simultaneous Optimistic Optimization 2013

Generic Exploration and K-armed Voting Bandits 2013

Robust Structural Metric Learning 2013