LCD: Learned Cross-Domain Descriptors for 2D-3D Matching

Quang-Hieu Pham; Mikaela Angelina Uy; Binh-Son Hua; Duc Thanh Nguyen; Gemma Roig; Sai-Kit Yeung

2020 AAAI AAAI 2020

LCD: Learned Cross-Domain Descriptors for 2D-3D Matching

Abstract

Abstract In this work, we present a novel method to learn a local cross-domain descriptor for 2D image and 3D point cloud matching. Our proposed method is a dual auto-encoder neural network that maps 2D and 3D input into a shared latent space representation. We show that such local cross-domain descriptors in the shared embedding are more discriminative than those obtained from individual training in 2D and 3D domains. To facilitate the training process, we built a new dataset by collecting ≈ 1.4 millions of 2D-3D correspondences with various lighting conditions and settings from publicly available RGB-D scenes. Our descriptor is evaluated in three main experiments: 2D-3D matching, cross-domain retrieval, and sparse-to-dense depth estimation. Experimental results confirm the robustness of our approach as well as its competitive performance not only in solving cross-domain tasks but also in being able to generalize to solve sole 2D and 3D tasks. Our dataset and code are released publicly at https://hkust-vgd.github.io/lcd.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — cross-domain descriptor

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Quang-Hieu Pham , Mikaela Angelina Uy , Binh-Son Hua , Duc Thanh Nguyen , Gemma Roig , Sai-Kit Yeung

Topics

Machine Learning > Core Methods > Embedding Learning Deep Learning > Architectures > Autoencoders Computer Vision > Analysis > 3D Vision

Keywords

point cloud latent space 2d-3d matching autoencoder network cross-domain descriptor

Download PDF

Related papers

Enhancing Pointer Network for Sentence Ordering with Pairwise Ordering Predictions 2020

CopyMTL: Copy Mechanism for Joint Extraction of Entities and Relations with Multi-Task Learning 2020

Neural Simile Recognition with Cyclic Multitask Learning and Local Attention 2020

Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy 2020

Multi-Point Semantic Representation for Intent Classification 2020