Estimating Human Pose with Flowing Puppets

Silvia Zuffi; Javier Romero; Cordelia Schmid; Michael J. Black

2013 ICCV ICCV 2013

Estimating Human Pose with Flowing Puppets

Abstract

We address the problem of upper-body human pose estimation in uncontrolled monocular video sequences, without manual initialization. Most current methods focus on isolated video frames and often fail to correctly localize arms and hands. Inferring pose over a video sequence is advantageous because poses of people in adjacent frames exhibit properties of smooth variation due to the nature of human and camera motion. To exploit this, previous methods have used prior knowledge about distinctive actions or generic temporal priors combined with static image likelihoods to track people in motion. Here we take a different approach based on a simple observation: Information about how a person moves from frame to frame is present in the optical flow field. We develop an approach for tracking articulated motions that "links" articulated shape models of people in adjacent frames through the dense optical flow. Key to this approach is a 2D shape model of the body that we use to compute how the body moves over time. The resulting "flowing puppets" provide a way of integrating image evidence across frames to improve pose inference. We apply our method on a challenging dataset of TV video sequences and show state-of-the-art performance.

🚀 Conference Pioneer — ICCV 2013

🐣 Hot Topic Early Bird — monocular video

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Silvia Zuffi , Javier Romero , Cordelia Schmid , Michael J. Black

Topics

Computer Vision > Analysis > Human Pose Estimation Computer Vision > Processing > Video Processing Computer Vision > Processing > Video Understanding Computer Vision > Analysis > Motion Analysis

Keywords

pose estimation human pose estimation video tracking optical flow monocular video pose tracking human pose articulated motion shape model

Download PDF

Related papers

Large-Scale Multi-resolution Surface Reconstruction from RGB-D Sequences 2013

Cascaded Shape Space Pruning for Robust Facial Landmark Detection 2013

Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach 2013

Accurate and Robust 3D Facial Capture Using a Single RGBD Camera 2013

From Where and How to What We See 2013