Learning the Depths of Moving People by Watching Frozen People

Zhengqi Li; Tali Dekel; Forrester Cole; Richard Tucker; Noah Snavely; Ce Liu; William T. Freeman

2019 CVPR CVPR 2019

Learning the Depths of Moving People by Watching Frozen People

Abstract

We present a method for predicting dense depth in scenarios where both a monocular camera and people in the scene are freely moving. Existing methods for recovering depth for dynamic, non-rigid objects from monocular video impose strong assumptions on the objects' motion and may only recover sparse depth. In this paper, we take a data-driven approach and learn human depth priors from a new source of data: thousands of Internet videos of people imitating mannequins, i.e., freezing in diverse, natural poses, while a hand-held camera tours the scene. Since the people are stationary, training data can be created from these videos using multi-view stereo reconstruction. At inference time, our method uses motion parallax cues from the static areas of the scenes, and shows clear improvement over state-of-the-art monocular depth prediction methods. We demonstrate our method on real-world sequences of complex human actions captured by a moving hand-held camera, and show various 3D effects produced using our predicted depth.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — monocular depth prediction

🐣 Hot Topic Early Bird — multi-view stereo

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zhengqi Li , Tali Dekel , Forrester Cole , Richard Tucker , Noah Snavely , Ce Liu , William T. Freeman

Topics

Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Depth Estimation Deep Learning > Learning Types > Deep Learning Computer Vision > Processing > Depth Estimation

Keywords

3d reconstruction depth estimation multi-view stereo monocular depth monocular depth prediction monocular camera motion parallax human depth human depth prior

Download PDF

Related papers

Fast Single Image Reflection Suppression via Convex Optimization 2019

Learning Video Representations From Correspondence Proposals 2019

ATOM: Accurate Tracking by Overlap Maximization 2019

Visual Tracking via Adaptive Spatially-Regularized Correlation Filters 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning 2019