Learning Taxonomy Adaptation in Large-scale Classification

Rohit Babbar; Ioannis Partalas; Eric Gaussier; Massih-Reza Amini; Cécile Amblard

2016 JMLR JMLR 2016

Learning Taxonomy Adaptation in Large-scale Classification

Abstract

In this paper, we study flat and hierarchical classification strategies in the context of large-scale taxonomies. Addressing the problem from a learning-theoretic point of view, we first propose a multi-class, hierarchical data dependent bound on the generalization error of classifiers deployed in large-scale taxonomies. This bound provides an explanation to several empirical results reported in the literature, related to the performance of flat and hierarchical classifiers. Based on this bound, we also propose a technique for modifying a given taxonomy through pruning, that leads to a lower value of the upper bound as compared to the original taxonomy. We then present another method for hierarchy pruning by studying approximation error of a family of classifiers, and derive from it features used in a meta-classifier to decide which nodes to prune. We finally illustrate the theoretical developments through several experiments conducted on two widely used taxonomies. [abs] [ pdf ][ bib ] © JMLR 2016. (edit, beta)

🧭 Keyword Pioneer — hierarchy pruning

🐣 Hot Topic Early Bird — generalization bound

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

Authors

Rohit Babbar , Ioannis Partalas , Eric Gaussier , Massih-Reza Amini , Cécile Amblard

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Learning Paradigms > Multi-Task Learning

Keywords

hierarchical classification multi-class classification generalization bound hierarchy pruning taxonomy adaptation

Download PDF

Related papers

Trend Filtering on Graphs 2016

Causal Inference through a Witness Protection Program 2016

A Characterization of Linkage-Based Hierarchical Clustering 2016

How to Center Deep Boltzmann Machines 2016

Minimax Rates in Permutation Estimation for Feature Matching 2016