On Flat versus Hierarchical Classification in Large-Scale Taxonomies

Rohit Babbar; Ioannis Partalas; Eric Gaussier; Massih R. Amini

2013 NIPS NeurIPS 2013

On Flat versus Hierarchical Classification in Large-Scale Taxonomies

Abstract

We study in this paper flat and hierarchical classification strategies in the context of large-scale taxonomies. To this end, we first propose a multiclass, hierarchical data dependent bound on the generalization error of classifiers deployed in large-scale taxonomies. This bound provides an explanation to several empirical results reported in the literature, related to the performance of flat and hierarchical classifiers. We then introduce another type of bounds targeting the approximation error of a family of classifiers, and derive from it features used in a meta-classifier to decide which nodes to prune (or flatten) in a large-scale taxonomy. We finally illustrate the theoretical developments through several experiments conducted on two widely used taxonomies.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Machine Learning

📈 Trend Setter — Data Mining

🧭 Keyword Pioneer — taxonomy pruning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

🐣 Hot Topic Early Bird — hierarchical classification

Authors

Rohit Babbar , Ioannis Partalas , Eric Gaussier , Massih R. Amini

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Optimization & Theory > Theory Machine Learning > Application Areas > Domain Adaptation Data Science & Analytics > Methods > Data Mining Machine Learning > Learning Types > Classification Machine Learning > Learning Types > Multi-Class Classification Machine Learning > Core Methods > Evaluation Machine Learning > Optimization & Theory > Generalization

Keywords

hierarchical classification generalization error approximation error generalization error bound taxonomy pruning meta-classifier large-scale taxonomies generalization bound flat classification

Download PDF

Related papers

Latent Structured Active Learning 2013

Generalized Method-of-Moments for Rank Aggregation 2013

Third-Order Edge Statistics: Contour Continuation, Curvature, and Cortical Connections 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent 2013

Action is in the Eye of the Beholder: Eye-gaze Driven Model for Spatio-Temporal Action Localization 2013