DeepCap: Monocular Human Performance Capture Using Weak Supervision

Marc Habermann; Weipeng Xu; Michael Zollhöfer; Gerard Pons-Moll; Christian Theobalt

2020 CVPR CVPR 2020

DeepCap: Monocular Human Performance Capture Using Weak Supervision

Abstract

Human performance capture is a highly important computer vision problem with many applications in movie production and virtual/augmented reality. Many previous performance capture approaches either required expensive multi-view setups or did not recover dense space-time coherent geometry with frame-to-frame correspondences. We propose a novel deep learning approach for monocular dense human performance capture. Our method is trained in a weakly supervised manner based on multi-view supervision completely removing the need for training data with 3D ground truth annotations. The network architecture is based on two separate networks that disentangle the task into a pose estimation and a non-rigid surface deformation step. Extensive qualitative and quantitative evaluations show that our approach outperforms the state of the art in terms of quality and robustness.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — non-rigid surface deformation

🐣 Hot Topic Early Bird — monocular depth estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Marc Habermann , Weipeng Xu , Michael Zollhöfer , Gerard Pons-Moll , Christian Theobalt

Topics

Machine Learning > Learning Types > Weakly Supervised Learning Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Human Analysis Computer Vision > Analysis > Human Pose Estimation Deep Learning > Learning Types > Self-Supervised Learning Computer Vision > Core AI > Computer Vision Deep Learning > Learning Types > Weakly Supervised Learning Computer Vision > Domain-Specific > Computer Graphics

Keywords

pose estimation monocular depth estimation weak supervision monocular vision non-rigid deformation human performance capture non-rigid surface deformation multi-view supervision monocular capture

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020