Efficient ConvNet-Based Marker-Less Motion Capture in General Scenes With a Low Number of Cameras

Ahmed Elhayek; Edilson de Aguiar; Arjun Jain; Jonathan Tompson; Leonid Pishchulin; Micha Andriluka; Chris Bregler; Bernt Schiele; Christian Theobalt

2015 CVPR CVPR 2015

Efficient ConvNet-Based Marker-Less Motion Capture in General Scenes With a Low Number of Cameras

Abstract

We present a novel method for accurate marker-less capture of articulated skeleton motion of several subjects in general scenes, indoors and outdoors, even from input filmed with as few as two cameras. Our approach unites a discriminative image-based joint detection method with a model-based generative motion tracking algorithm through a combined pose optimization energy. The discriminative part-based pose detection method, implemented using Convolutional Networks (ConvNet), estimates unary potentials for each joint of a kinematic skeleton model. These unary potentials are used to probabilistically extract pose constraints for tracking by using weighted sampling from a pose posterior guided by the model. In the final energy, these constraints are combined with an appearance-based model-to-image similarity term. Poses can be computed very efficiently using iterative local optimization, as ConvNet detection is fast, and our formulation yields a combined pose estimation energy with analytic derivatives. In combination, this enables to track full articulated joint angles at state-of-the-art accuracy and temporal stability with a very low number of cameras.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — multi-view pose

🐣 Hot Topic Early Bird — convolutional network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ahmed Elhayek , Edilson de Aguiar , Arjun Jain , Jonathan Tompson , Leonid Pishchulin , Micha Andriluka , Chris Bregler , Bernt Schiele , Christian Theobalt

Topics

Computer Vision > Analysis > Human Pose Estimation Computer Vision > Analysis > Object Tracking Computer Vision > Analysis > Motion Estimation Deep Learning > Architectures > Convolutional Neural Networks

Keywords

pose estimation convolutional network skeleton tracking generative tracking model-based tracking multi-view pose marker-less motion capture articulated skeleton

Download PDF

Related papers

Long-Term Correlation Tracking 2015

Hierarchically-Constrained Optical Flow 2015

Propagated Image Filtering 2015

Web Scale Photo Hash Clustering on A Single Machine 2015

Expanding Object Detector's Horizon: Incremental Learning Framework for Object Detection in Videos 2015