Deep Fully-Connected Part-Based Models for Human Pose Estimation

Rodrigo de Bem; Anurag Arnab; Stuart Golodetz; Michael Sapienza; Philip Torr

2018 ACML ACML 2018

Deep Fully-Connected Part-Based Models for Human Pose Estimation

Abstract

We propose a 2D multi-level appearance representation of the human body in RGB images, spatially modelled using a fully-connected graphical model. The appearance model is based on a CNN body part detector, which uses shared features in a cascade architecture to simultaneously detect body parts with different levels of granularity. We use a fully-connected Conditional Random Field (CRF) as our spatial model, over which approximate inference is efficiently performed using the Mean-Field algorithm, implemented as a Recurrent Neural Network (RNN). The stronger visual support from body parts with different levels of granularity, along with the fully-connected pairwise spatial relations, which have their weights learnt by the model, improve the performance of the bottom-up part detector. We adopt an end-to-end training strategy to leverage the potential of both our appearance and spatial models, and achieve competitive results on the MPII and LSP datasets.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Rodrigo de Bem , Anurag Arnab , Stuart Golodetz , Michael Sapienza , Philip Torr

Topics

Machine Learning > Core Methods > Classification Deep Learning > Architectures > Neural Networks Computer Vision > Analysis > Human Pose Estimation Machine Learning > Core Methods > Graphical Models Artificial Intelligence > Core AI > Computer Vision Deep Learning > Models > Neural Networks

Keywords

feature learning human pose estimation body part detection convolutional neural network recurrent neural network conditional random field mean field inference fully-connected model mean-field algorithm

Download PDF

Related papers

Unsupervised Heterogeneous Domain Adaptation with Sparse Feature Transformation 2018

Structured Gaussian Processes with Twin Multiple Kernel Learning 2018

Discriminative Feature Representation for Person Re-identification by Batch-contrastive Loss 2018

Adversarial TableQA: Attention Supervision for Question Answering on Tables 2018

Who Are Raising Their Hands? Hand-Raiser Seeking Based on Object Detection and Pose Estimation 2018