Learning a Tree of Metrics with Disjoint Visual Features

Kristen Grauman; Fei Sha; Sung Ju Hwang

2011 NIPS NeurIPS 2011

Learning a Tree of Metrics with Disjoint Visual Features

Abstract

We introduce an approach to learn discriminative visual representations while exploiting external semantic knowledge about object category relationships. Given a hierarchical taxonomy that captures semantic similarity between the objects, we learn a corresponding tree of metrics (ToM). In this tree, we have one metric for each non-leaf node of the object hierarchy, and each metric is responsible for discriminating among its immediate subcategory children. Specifically, a Mahalanobis metric learned for a given node must satisfy the appropriate (dis)similarity constraints generated only among its subtree members' training instances. To further exploit the semantics, we introduce a novel regularizer coupling the metrics that prefers a sparse disjoint set of features to be selected for each metric relative to its ancestor supercategory nodes' metrics. Intuitively, this reflects that visual cues most useful to distinguish the generic classes (e.g., feline vs. canine) should be different than those cues most useful to distinguish their component fine-grained classes (e.g., Persian cat vs. Siamese cat). We validate our approach with multiple image datasets using the WordNet taxonomy, show its advantages over alternative metric learning approaches, and analyze the meaning of attribute features selected by our algorithm.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🧭 Keyword Pioneer — visual representation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

📈 Trend Setter — Computer Vision

🐣 Hot Topic Early Bird — metric learning

Authors

Kristen Grauman , Fei Sha , Sung Ju Hwang

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Core Methods > Metric Learning Computer Vision > Analysis > Object Detection Machine Learning > Learning Types > Multi-Task Learning Computer Vision > Core AI > Computer Vision

Keywords

metric learning feature selection visual representation fine-grained classification visual recognition mahalanobis distance hierarchical taxonomy mahalanobis metric disjoint feature learning tree of metrics disjoint feature selection discriminative model

Download PDF

Related papers

Co-Training for Domain Adaptation 2011

The Local Rademacher Complexity of Lp-Norm Multiple Kernel Learning 2011

Learning to Agglomerate Superpixel Hierarchies 2011

A Reinforcement Learning Theory for Homeostatic Regulation 2011

A Global Structural EM Algorithm for a Model of Cancer Progression 2011