Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval

Fan Yang; Zheng Wang; Jing Xiao; Shin'ichi Satoh

2020 AAAI AAAI 2020

Mining on Heterogeneous Manifolds for Zero-Shot Cross-Modal Image Retrieval

Abstract

Abstract Most recent approaches for the zero-shot cross-modal image retrieval map images from different modalities into a uniform feature space to exploit their relevance by using a pre-trained model. Based on the observation that manifolds of zero-shot images are usually deformed and incomplete, we argue that the manifolds of unseen classes are inevitably distorted during the training of a two-stream model that simply maps images from different modalities into a uniform space. This issue directly leads to poor cross-modal retrieval performance. We propose a bi-directional random walk scheme to mining more reliable relationships between images by traversing heterogeneous manifolds in the feature space of each modality. Our proposed method benefits from intra-modal distributions to alleviate the interference caused by noisy similarities in the cross-modal feature space. As a result, we achieved great improvement in the performance of the thermal v.s. visible image retrieval task. The code of this paper: https://github.com/fyang93/cross-modal-retrieval

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🐣 Hot Topic Early Bird — feature space

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Fan Yang , Zheng Wang , Jing Xiao , Shin'ichi Satoh

Topics

Machine Learning > Core Methods > Metric Learning Machine Learning > Learning Types > Zero-Shot Learning Computer Vision > Analysis > Object Detection Deep Learning > Learning Types > Transfer Learning Deep Learning > Learning Types > Zero-Shot Learning Computer Vision > Analysis > Image Retrieval

Keywords

zero-shot learning metric learning manifold learning cross-modal retrieval random walk feature space

Download PDF

Related papers

Enhancing Pointer Network for Sentence Ordering with Pairwise Ordering Predictions 2020

CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning 2020

Neural Simile Recognition with Cyclic Multitask Learning and Local Attention 2020

Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy 2020

Multi-Point Semantic Representation for Intent Classification 2020