Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data

Shichao Li; Lei Ke; Kevin Pratama; Yu-Wing Tai; Chi-Keung Tang; Kwang-Ting Cheng

2020 CVPR CVPR 2020

Cascaded Deep Monocular 3D Human Pose Estimation With Evolutionary Training Data

Abstract

End-to-end deep representation learning has achieved remarkable accuracy for monocular 3D human pose estimation, yet these models may fail for unseen poses with limited and fixed training data. This paper proposes a novel data augmentation method that: (1) is scalable for synthesizing massive amount of training data (over 8 million valid 3D human poses with corresponding 2D projections) for training 2D-to-3D networks, (2) can effectively reduce dataset bias. Our method evolves a limited dataset to synthesize unseen 3D human skeletons based on a hierarchical human representation and heuristics inspired by prior knowledge. Extensive experiments show that our approach not only achieves state-of-the-art accuracy on the largest public benchmark, but also generalizes significantly better to unseen and rare poses. Relevant files and tools are available at the project website.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🐣 Hot Topic Early Bird — monocular depth estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Shichao Li , Lei Ke , Kevin Pratama , Yu-Wing Tai , Chi-Keung Tang , Kwang-Ting Cheng

Topics

Machine Learning > Application Areas > Data Augmentation Deep Learning > Techniques > Pretraining Computer Vision > Analysis > Human Pose Estimation Deep Learning > Learning Types > Deep Learning Deep Learning > Learning Types > Data Augmentation

Keywords

representation learning data augmentation 3d vision monocular depth estimation human pose estimation deep learning 3d human pose estimation evolutionary algorithm monocular vision neural network

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020