Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation

Lubor Ladicky; Philip H.S. Torr; Andrew Zisserman

2013 CVPR CVPR 2013

Human Pose Estimation Using a Joint Pixel-wise and Part-wise Formulation

Abstract

Our goal is to detect humans and estimate their 2D pose in single images. In particular, handling cases of partial visibility where some limbs may be occluded or one person is partially occluding another. Two standard, but disparate, approaches have developed in the field: the first is the part based approach for layout type problems, involving optimising an articulated pictorial structure; the second is the pixel based approach for image labelling involving optimising a random field graph defined on the image. Our novel contribution is a formulation for pose estimation which combines these two models in a principled way in one optimisation problem and thereby inherits the advantages of both of them. Inference on this joint model finds the set of instances of persons in an image, the location of their joints, and a pixel-wise body part labelling. We achieve near or state of the art results on standard human pose data sets, and demonstrate the correct estimation for cases of self-occlusion, person overlap and image truncation.

🚀 Conference Pioneer — CVPR 2013

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — articulated structure

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lubor Ladicky , Philip H.S. Torr , Andrew Zisserman

Topics

Computer Vision > Analysis > Human Pose Estimation Computer Vision > Analysis > Object Segmentation Machine Learning > Core Methods > Structured Prediction Deep Learning > Learning Types > Structured Prediction

Keywords

semantic segmentation human pose estimation occlusion handling graphical model part-based model pixel-wise labeling pictorial structure articulated structure

Download PDF

Related papers

Nonlinearly Constrained MRFs: Exploring the Intrinsic Dimensions of Higher-Order Cliques 2013

An Approach to Pose-Based Action Recognition 2013

Modeling Actions through State Changes 2013

A Convex Regularizer for Reducing Color Artifact in Color Image Recovery 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences 2013