SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation

Lijun Wang; Jianming Zhang; Oliver Wang; Zhe Lin; Huchuan Lu

2020 CVPR CVPR 2020

SDC-Depth: Semantic Divide-and-Conquer Network for Monocular Depth Estimation

Abstract

Monocular depth estimation is an ill-posed problem, and as such critically relies on scene priors and semantics. Due to its complexity, we propose a deep neural network model based on a semantic divide-and-conquer approach. Our model decomposes a scene into semantic segments, such as object instances and background stuff classes, and then predicts a scale and shift invariant depth map for each semantic segment in a canonical space. Semantic segments of the same category share the same depth decoder, so the global depth prediction task is decomposed into a series of category-specific ones, which are simpler to learn and easier to generalize to new scene types. Finally, our model stitches each local depth segment by predicting its scale and shift based on the global context of the image. The model is trained end-to-end using a multi-task loss for panoptic segmentation and depth prediction, and is therefore able to leverage large-scale panoptic segmentation datasets to boost its semantic understanding. We validate the effectiveness of our approach and show state-of-the-art performance on three benchmark datasets.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🐣 Hot Topic Early Bird — monocular depth estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lijun Wang , Jianming Zhang , Oliver Wang , Zhe Lin , Huchuan Lu

Topics

Deep Learning > Techniques > Model Architecture Computer Vision > Analysis > Depth Estimation Computer Vision > Processing > Semantic Segmentation Deep Learning > Learning Types > Multi-Task Learning Computer Vision > Processing > Depth Estimation

Keywords

semantic segmentation multi-task learning scene understanding depth estimation monocular depth estimation divide and conquer panoptic segmentation monocular depth

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020