Synthesizing Images of Humans in Unseen Poses

Guha Balakrishnan; Amy Zhao; Adrian V. Dalca; Fredo Durand; John Guttag

2018 CVPR CVPR 2018

Synthesizing Images of Humans in Unseen Poses

Abstract

We address the computational problem of novel human pose synthesis. Given an image of a person and a desired pose, we produce a depiction of that person in that pose, retaining the appearance of both the person and background. We present a modular generative neural network that synthesizes unseen poses using training pairs of images and poses taken from human action videos. Our network separates a scene into different body part and background layers, moves body parts to new locations and refines their appearances, and composites the new foreground with a hole-filled background. These subtasks, implemented with separate modules, are trained jointly using only a single target image as a supervised label. We use an adversarial discriminator to force our network to synthesize realistic details conditioned on pose. We demonstrate image synthesis results on three action classes: golf, yoga/workouts and tennis, and show that our method produces accurate results within action classes as well as across action classes. Given a sequence of desired poses, we also produce coherent videos of actions.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — human pose synthesis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Guha Balakrishnan , Amy Zhao , Adrian V. Dalca , Fredo Durand , John Guttag

Topics

Deep Learning > Models > Generative Models Computer Vision > Analysis > Human Pose Estimation Computer Vision > Generation > Image Generation Computer Vision > Processing > Image Processing Deep Learning > Learning Types > Generative Models

Keywords

image generation person re-identification adversarial training generative adversarial network pose transfer adversarial discriminator generative network image composition pose manipulation human pose synthesis modular generative network action video synthesis

Download PDF

Related papers

Multi-Shot Pedestrian Re-Identification via Sequential Decision Making 2018

Multi-Cue Correlation Filters for Robust Visual Tracking 2018

Pointwise Convolutional Neural Networks 2018

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking 2018

Image Generation From Scene Graphs 2018