RankMI: A Mutual Information Maximizing Ranking Loss

Mete Kemertas; Leila Pishdad; Konstantinos G. Derpanis; Afsaneh Fazly

2020 CVPR CVPR 2020

RankMI: A Mutual Information Maximizing Ranking Loss

Abstract

We introduce an information-theoretic loss function, RankMI, and an associated training algorithm for deep representation learning for image retrieval. Our proposed framework consists of alternating updates to a network that estimates the divergence between distance distributions of matching and non-matching pairs of learned embeddings, and an embedding network that maximizes this estimate via sampled negatives. In addition, under this information-theoretic lens we draw connections between RankMI and commonly-used ranking losses, e.g., triplet loss. We extensively evaluate RankMI on several standard image retrieval datasets, namely, CUB-200-2011, CARS-196, and Stanford Online Products. Our method achieves competitive results or significant improvements over previous reported results on all datasets.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mete Kemertas , Leila Pishdad , Konstantinos G. Derpanis , Afsaneh Fazly

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Core Methods > Metric Learning Computer Vision > Analysis > Object Detection Machine Learning > Learning Types > Representation Learning Machine Learning > Learning Types > Metric Learning Deep Learning > Learning Types > Representation Learning Computer Vision > Analysis > Image Retrieval

Keywords

representation learning metric learning image retrieval embedding learning mutual information ranking loss triplet loss

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020