Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks

Yujun Cai; Liuhao Ge; Jun Liu; Jianfei Cai; Tat-Jen Cham; Junsong Yuan; Nadia Magnenat Thalmann

2019 ICCV ICCV 2019

Exploiting Spatial-Temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks

Abstract

Despite great progress in 3D pose estimation from single-view images or videos, it remains a challenging task due to the substantial depth ambiguity and severe self-occlusions. Motivated by the effectiveness of incorporating spatial dependencies and temporal consistencies to alleviate these issues, we propose a novel graph-based method to tackle the problem of 3D human body and 3D hand pose estimation from a short sequence of 2D joint detections. Particularly, domain knowledge about the human hand (body) configurations is explicitly incorporated into the graph convolutional operations to meet the specific demand of the 3D pose estimation. Furthermore, we introduce a local-to-global network architecture, which is capable of learning multi-scale features for the graph-based representations. We evaluate the proposed method on challenging benchmark datasets for both 3D hand pose estimation and 3D body pose estimation. Experimental results show that our method achieves state-of-the-art performance on both tasks.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — spatial-temporal relationship

🐣 Hot Topic Early Bird — hand pose estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yujun Cai , Liuhao Ge , Jun Liu , Jianfei Cai , Tat-Jen Cham , Junsong Yuan , Nadia Magnenat Thalmann

Topics

Deep Learning > Architectures > Graph Neural Networks Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Human Pose Estimation

Keywords

hand pose estimation human pose estimation 3d pose estimation graph convolutional network spatial-temporal relationship

Download PDF

Related papers

Hierarchical Self-Attention Network for Action Localization in Videos 2019

StructureFlow: Image Inpainting via Structure-Aware Appearance Flow 2019

Overcoming Catastrophic Forgetting With Unlabeled Data in the Wild 2019

Compact Trilinear Interaction for Visual Question Answering 2019

A2J: Anchor-to-Joint Regression Network for 3D Articulated Pose Estimation From a Single Depth Image 2019