Recurrent MVSNet for High-Resolution Multi-View Stereo Depth Inference

Yao Yao; Zixin Luo; Shiwei Li; Tianwei Shen; Tian Fang; Long Quan

2019 CVPR CVPR 2019

Recurrent MVSNet for High-Resolution Multi-View Stereo Depth Inference

Abstract

Deep learning has recently demonstrated its excellent performance for multi-view stereo (MVS). However, one major limitation of current learned MVS approaches is the scalability: the memory-consuming cost volume regularization makes the learned MVS hard to be applied to high-resolution scenes. In this paper, we introduce a scalable multi-view stereo framework based on the recurrent neural network. Instead of regularizing the entire 3D cost volume in one go, the proposed Recurrent Multi-view Stereo Network (R-MVSNet) sequentially regularizes the 2D cost maps along the depth direction via the gated recurrent unit (GRU). This reduces dramatically the memory consumption and makes high-resolution reconstruction feasible. We first show the state-of-the-art performance achieved by the proposed R-MVSNet on the recent MVS benchmarks. Then, we further demonstrate the scalability of the proposed method on several large-scale scenarios, where previous learned approaches often fail due to the memory constraint. Code is available at https://github.com/YoYo000/MVSNet.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning

🧭 Keyword Pioneer — cost volume regularization

🐣 Hot Topic Early Bird — multi-view stereo

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yao Yao , Zixin Luo , Shiwei Li , Tianwei Shen , Tian Fang , Long Quan

Topics

Deep Learning > Architectures > Neural Networks Deep Learning > Models > Generative Models Computer Vision > Analysis > 3D Vision Artificial Intelligence > Core AI > Computer Vision Deep Learning > Learning Types > Deep Learning Deep Learning > Architectures > Recurrent Neural Networks Computer Vision > Processing > Depth Estimation

Keywords

3d reconstruction recurrent neural network multi-view stereo gated recurrent unit cost volume regularization high-resolution reconstruction depth inference

Download PDF

Related papers

Fast Single Image Reflection Suppression via Convex Optimization 2019

Learning Video Representations From Correspondence Proposals 2019

ATOM: Accurate Tracking by Overlap Maximization 2019

Visual Tracking via Adaptive Spatially-Regularized Correlation Filters 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning 2019