RGB-D Local Implicit Function for Depth Completion of Transparent Objects

Luyang Zhu; Arsalan Mousavian; Yu Xiang; Hammad Mazhar; Jozef van Eenbergen; Shoubhik Debnath; Dieter Fox

2021 CVPR CVPR 2021

RGB-D Local Implicit Function for Depth Completion of Transparent Objects

Abstract

Majority of the perception methods in robotics require depth information provided by RGB-D cameras. However, standard 3D sensors fail to capture depth of transparent objects due to refraction and absorption of light. In this paper, we introduce a new approach for depth completion of transparent objects from a single RGB-D image. Key to our approach is a local implicit neural representation built on ray-voxel pairs that allows our method to generalize to unseen objects and achieve fast inference speed. Based on this representation, we present a novel framework that can complete missing depth given noisy RGB-D input. We further improve the depth estimation iteratively using a self-correcting refinement model. To train the whole pipeline, we build a large scale synthetic dataset with transparent objects. Experiments demonstrate that our method performs significantly better than the current state-of-the-art methods on both synthetic and real world data. In addition, our approach improves the inference speed by a factor of 20 compared to the previous best method, ClearGrasp. Code will be released at https://research.nvidia.com/publication/2021-03_RGB-D-Local-Implicit.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning and Robotics

🧭 Keyword Pioneer — local implicit function

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Luyang Zhu , Arsalan Mousavian , Yu Xiang , Hammad Mazhar , Jozef van Eenbergen , Shoubhik Debnath , Dieter Fox

Topics

Machine Learning > Core Methods > Representation Learning Deep Learning > Architectures > Neural Networks Computer Vision > Analysis > Depth Estimation Robotics > Capabilities > Perception Computer Vision > Domain-Specific > Robotics Computer Vision > Processing > Depth Estimation

Keywords

depth estimation sensor fusion implicit neural representation depth completion neural implicit representation transparent object rgb-d imaging local implicit function ray-voxel pair self-correcting refinement

Download PDF

Related papers

Learning To Reconstruct High Speed and High Dynamic Range Videos From Events 2021

DeFLOCNet: Deep Image Editing via Flexible Low-Level Controls 2021

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs 2021

Coming Down to Earth: Satellite-to-Street View Synthesis for Geo-Localization 2021

Pose-Guided Human Animation From a Single Image in the Wild 2021