Spatiotemporal Deformable Part Models for Action Detection

Yicong Tian; Rahul Sukthankar; Mubarak Shah

2013 CVPR CVPR 2013

Spatiotemporal Deformable Part Models for Action Detection

Abstract

Deformable part models have achieved impressive performance for object detection, even on difficult image datasets. This paper explores the generalization of deformable part models from 2D images to 3D spatiotemporal volumes to better study their effectiveness for action detection in video. Actions are treated as spatiotemporal patterns and a deformable part model is generated for each action from a collection of examples. For each action model, the most discriminative 3D subvolumes are automatically selected as parts and the spatiotemporal relations between their locations are learned. By focusing on the most distinctive parts of each action, our models adapt to intra-class variation and show robustness to clutter. Extensive experiments on several video datasets demonstrate the strength of spatiotemporal DPMs for classifying and localizing actions.

🚀 Conference Pioneer — CVPR 2013

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yicong Tian , Rahul Sukthankar , Mubarak Shah

Topics

Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Action Recognition Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Video Understanding

Keywords

object detection video classification video understanding spatiotemporal modeling deformable part model action detection

Download PDF

Related papers

Nonlinearly Constrained MRFs: Exploring the Intrinsic Dimensions of Higher-Order Cliques 2013

An Approach to Pose-Based Action Recognition 2013

Modeling Actions through State Changes 2013

A Convex Regularizer for Reducing Color Artifact in Color Image Recovery 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences 2013