InverseFaceNet: Deep Monocular Inverse Face Rendering

Hyeongwoo Kim; Michael Zollhöfer; Ayush Tewari; Justus Thies; Christian Richardt; Christian Theobalt

2018 CVPR CVPR 2018

InverseFaceNet: Deep Monocular Inverse Face Rendering

Abstract

We introduce InverseFaceNet, a deep convolutional inverse rendering framework for faces that jointly estimates facial pose, shape, expression, reflectance and illumination from a single input image. By estimating all parameters from just a single image, advanced editing possibilities on a single face image, such as appearance editing and relighting, become feasible in real time. Most previous learning-based face reconstruction approaches do not jointly recover all dimensions, or are severely limited in terms of visual quality. In contrast, we propose to recover high-quality facial pose, shape, expression, reflectance and illumination using a deep neural network that is trained using a large, synthetically created training corpus. Our approach builds on a novel loss function that measures model-space similarity directly in parameter space and significantly improves reconstruction accuracy.We further propose a self-supervised bootstrapping process in the network training loop, which iteratively updates the synthetic training corpus to better reflect the distribution of real-world imagery. We demonstrate that this strategy outperforms completely synthetically trained networks. Finally, we show high-quality reconstructions and compare our approach to several state-of-the-art approaches.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — appearance editing

🐣 Hot Topic Early Bird — inverse rendering

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hyeongwoo Kim , Michael Zollhöfer , Ayush Tewari , Justus Thies , Christian Richardt , Christian Theobalt

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Core Methods > Representation Learning Machine Learning > Learning Types > Self-Supervised Learning Computer Vision > Analysis > Face Recognition Deep Learning > Learning Types > Deep Learning Computer Vision > Core AI > Computer Vision Computer Vision > Domain-Specific > 3D Vision

Keywords

computer vision self-supervised learning convolutional neural network deep convolutional neural network synthetic training face reconstruction deep convolutional network inverse rendering appearance editing facial reflectance image rendering inverse face rendering facial parameter estimation self-supervised bootstrapping synthetic training corpus

Download PDF

Related papers

Multi-Shot Pedestrian Re-Identification via Sequential Decision Making 2018

Multi-Cue Correlation Filters for Robust Visual Tracking 2018

Pointwise Convolutional Neural Networks 2018

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking 2018

Image Generation From Scene Graphs 2018