Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization

Anil Armagan; Martin Hirzer; Peter M. Roth; Vincent Lepetit

2017 CVPR CVPR 2017

Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization

Abstract

We present an efficient method for geolocalization in urban environments starting from a coarse estimate of the location provided by a GPS and using a simple untextured 2.5D model of the surrounding buildings. Our key contribution is a novel efficient and robust method to optimize the pose: We train a Deep Network to predict the best direction to improve a pose estimate, given a semantic segmentation of the input image and a rendering of the buildings from this estimate. We then iteratively apply this CNN until converging to a good pose. This approach avoids the use of reference images of the surroundings, which are difficult to acquire and match, while 2.5D models are broadly available. We can therefore apply it to places unseen during training.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🧭 Keyword Pioneer — 2.5d reconstruction

🐣 Hot Topic Early Bird — deep network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Anil Armagan , Martin Hirzer , Peter M. Roth , Vincent Lepetit

Topics

Machine Learning > Application Areas > Domain Adaptation Computer Vision > Analysis > Scene Understanding

Keywords

semantic segmentation pose estimation deep network 2.5d reconstruction

Download PDF

Related papers

Deep Outdoor Illumination Estimation 2017

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild 2017

Weakly Supervised Semantic Segmentation Using Web-Crawled Videos 2017

FASON: First and Second Order Information Fusion Network for Texture Recognition 2017

Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization 2017