Learning From Synthetic Humans

Gül Varol; Javier Romero; Xavier Martin; Naureen Mahmood; Michael J. Black; Ivan Laptev; Cordelia Schmid

2017 CVPR CVPR 2017

Learning From Synthetic Humans

Abstract

Estimating human pose, shape, and motion from images and video are fundamental challenges with many applications. Recent advances in 2D human pose estimation use large amounts of manually-labeled training data for learning convolutional neural networks (CNNs). Such data is time consuming to acquire and difficult to extend. Moreover, manual labeling of 3D pose, depth and motion is impractical. In this work we present SURREAL: a new large-scale dataset with synthetically-generated but realistic images of people rendered from 3D sequences of human motion capture data. We generate more than 6 million frames together with ground truth pose, depth maps, and segmentation masks. We show that CNNs trained on our synthetic dataset allow for accurate human depth estimation and human part segmentation in real RGB images. Our results and the new datast open up new possibilities for advancing person analysis using chap and large-scale synthetic data.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — human segmentation

🐣 Hot Topic Early Bird — motion capture

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Gül Varol , Javier Romero , Xavier Martin , Naureen Mahmood , Michael J. Black , Ivan Laptev , Cordelia Schmid

Topics

Deep Learning > Techniques > Pretraining Computer Vision > Analysis > Human Analysis Computer Vision > Analysis > Human Pose Estimation Machine Learning > Learning Types > Supervised Learning Deep Learning > Learning Types > Supervised Learning

Keywords

image segmentation depth estimation human motion capture human pose estimation motion capture convolutional neural network synthetic datum part segmentation human segmentation

Download PDF

Related papers

Deep Outdoor Illumination Estimation 2017

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild 2017

Weakly Supervised Semantic Segmentation Using Web-Crawled Videos 2017

FASON: First and Second Order Information Fusion Network for Texture Recognition 2017

Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization 2017