Neural Head Reenactment with Latent Pose Descriptors

Egor Burkov; Igor Pasechnik; Artur Grigorev; Victor Lempitsky

2020 CVPR CVPR 2020

Neural Head Reenactment with Latent Pose Descriptors

Abstract

We propose a neural head reenactment system, which is driven by a latent pose representation and is capable of predicting the foreground segmentation alongside the RGB image. The latent pose representation is learned as a part of the entire reenactment system, and the learning process is based solely on image reconstruction losses. We show that despite its simplicity, with a large and diverse enough training dataset, such learning successfully decomposes pose from identity. The resulting system can then reproduce mimics of the driving person and, furthermore, can perform cross-person reenactment. Additionally, we show that the learned descriptors are useful for other pose-related tasks, such as keypoint prediction and pose-based retrieval.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning

🧭 Keyword Pioneer — neural head reenactment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Egor Burkov , Igor Pasechnik , Artur Grigorev , Victor Lempitsky

Topics

Computer Vision > Analysis > Face Recognition Computer Vision > Generation > Image Generation Artificial Intelligence > Core AI > Computer Vision Deep Learning > Learning Types > Representation Learning Computer Vision > Analysis > Pose Estimation

Keywords

image generation pose estimation face recognition image reconstruction latent representation keypoint prediction neural network neural head reenactment cross-person reenactment latent pose descriptor

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020