LASER: LAtent SpacE Rendering for 2D Visual Localization

Zhixiang Min; Naji Khosravan; Zachary Bessinger; Manjunath Narayana; Sing Bing Kang; Enrique Dunn; Ivaylo Boyadzhiev

2022 CVPR CVPR 2022

LASER: LAtent SpacE Rendering for 2D Visual Localization

Abstract

We present LASER, an image-based Monte Carlo Localization (MCL) framework for 2D floor maps. LASER introduces the concept of latent space rendering, where 2D pose hypotheses on the floor map are directly rendered into a geometrically-structured latent space by aggregating viewing ray features. Through a tightly coupled rendering codebook scheme, the viewing ray features are dynamically determined at rendering-time based on their geometries (i.e. length, incident-angle), endowing our representation with view-dependent fine-grain variability. Our codebook scheme effectively disentangles feature encoding from rendering, allowing the latent space rendering to run at speeds above 10KHz. Moreover, through metric learning, our geometrically-structured latent space is common to both pose hypotheses and query images with arbitrary field of views. As a result, LASER achieves state-of-the-art performance on large-scale indoor localization datasets (i.e. ZInD and Structured3D) for both panorama and perspective image queries, while significantly outperforming existing learning-based methods in speed.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — monte carlo localization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zhixiang Min , Naji Khosravan , Zachary Bessinger , Manjunath Narayana , Sing Bing Kang , Enrique Dunn , Ivaylo Boyadzhiev

Topics

Machine Learning > Core Methods > Metric Learning Machine Learning > Application Areas > Domain Adaptation Computer Vision > Analysis > Scene Understanding Artificial Intelligence > Core AI > Computer Vision Deep Learning > Learning Types > Metric Learning

Keywords

metric learning pose estimation scene understanding visual localization monte carlo localization latent space rendering floor map

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022