Self-Supervised 3D Keypoint Learning for Ego-Motion Estimation

Jiexiong Tang; Rares Ambrus; Vitor Guizilini; Sudeep Pillai; Hanme Kim; Patric Jensfelt; Adrien Gaidon

2020 CORL CoRL 2020

Self-Supervised 3D Keypoint Learning for Ego-Motion Estimation

Abstract

Detecting and matching robust viewpoint-invariant keypoints is critical for visual SLAM and Structure-from-Motion. State-of-the-art learning-based methods generate training samples via homography adaptation to create 2D synthetic views with known keypoint matches from a single image. This approach does not, however, generalize to non-planar 3D scenes with illumination variations commonly seen in real-world videos. In this work, we propose self-supervised learning depth-aware keypoints from unlabeled videos directly. We jointly learn keypoint and depth estimation networks by combining appearance and geometric matching via a differentiable structure-from-motion module based on Procrustean residual pose correction. We show how our self-supervised keypoints can be trivially incorporated into state-of-the-art visual odometry frameworks for robust and accurate ego-motion estimation of autonomous vehicles in real-world conditions.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jiexiong Tang , Rares Ambrus , Vitor Guizilini , Sudeep Pillai , Hanme Kim , Patric Jensfelt , Adrien Gaidon

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Self-Supervised Learning Computer Vision > Processing > Video Understanding

Keywords

self-supervised learning depth estimation structure from motion visual odometry keypoint detection ego-motion estimation

Download PDF

Related papers

Augmenting GAIL with BC for sample efficient imitation learning 2020

Neuro-Symbolic Program Search for Autonomous Driving Decision Module Design 2020

LiRaNet: End-to-End Trajectory Prediction using Spatio-Temporal Radar Fusion 2020

DROGON: A Trajectory Prediction Model based on Intention-Conditioned Behavior Reasoning 2020

CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs 2020