Cross-Modality Earth Mover’s Distance for Visible Thermal Person Re-identification

Yongguo Ling; Zhun Zhong; Zhiming Luo; Fengxiang Yang; Donglin Cao; Yaojin Lin; Shaozi Li; Nicu Sebe

2023 AAAI AAAI 2023

Cross-Modality Earth Mover’s Distance for Visible Thermal Person Re-identification

Abstract

Abstract Visible thermal person re-identification (VT-ReID) suffers from inter-modality discrepancy and intra-identity variations. Distribution alignment is a popular solution for VT-ReID, however, it is usually restricted to the influence of the intra-identity variations. In this paper, we propose the Cross-Modality Earth Mover's Distance (CM-EMD) that can alleviate the impact of the intra-identity variations during modality alignment. CM-EMD selects an optimal transport strategy and assigns high weights to pairs that have a smaller intra-identity variation. In this manner, the model will focus on reducing the inter-modality discrepancy while paying less attention to intra-identity variations, leading to a more effective modality alignment. Moreover, we introduce two techniques to improve the advantage of CM-EMD. First, Cross-Modality Discrimination Learning (CM-DL) is designed to overcome the discrimination degradation problem caused by modality alignment. By reducing the ratio between intra-identity and inter-identity variances, CM-DL leads the model to learn more discriminative representations. Second, we construct the Multi-Granularity Structure (MGS), enabling us to align modalities from both coarse- and fine-grained levels with the proposed CM-EMD. Extensive experiments show the benefits of the proposed CM-EMD and its auxiliary techniques (CM-DL and MGS). Our method achieves state-of-the-art performance on two VT-ReID benchmarks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — visible thermal person re-identification

🐣 Hot Topic Early Bird — modality alignment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yongguo Ling , Zhun Zhong , Zhiming Luo , Fengxiang Yang , Donglin Cao , Yaojin Lin , Shaozi Li , Nicu Sebe

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Core Methods > Metric Learning Machine Learning > Optimization & Theory > Optimization Computer Vision > Analysis > Person Re-Identification Computer Vision > Core AI > Multimodal Learning Deep Learning > Learning Types > Metric Learning

Keywords

feature learning optimal transport feature matching person re-identification modality alignment distribution alignment cross-modality learning earth mover distance discriminative representation cross-modal matching visible thermal person re-identification intra-identity variation cross-modality discrepancy

Download PDF

Related papers

A Model-Agnostic Heuristics for Selective Classification 2023

Tackling Safe and Efficient Multi-Agent Reinforcement Learning via Dynamic Shielding (Student Abstract) 2023

Head-Free Lightweight Semantic Segmentation with Linear Transformer 2023

Hierarchical ConViT with Attention-Based Relational Reasoner for Visual Analogical Reasoning 2023

Deep Spiking Neural Networks with High Representation Similarity Model Visual Pathways of Macaque and Mouse 2023