Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos

Zhanning Gao; Le Wang; Qilin Zhang; Zhenxing Niu; Nanning Zheng; Gang Hua

2019 AAAI AAAI 2019

Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos

Abstract

Abstract We propose a temporal action detection by spatial segmentation framework, which simultaneously categorize actions and temporally localize action instances in untrimmed videos. The core idea is the conversion of temporal detection task into a spatial semantic segmentation task. Firstly, the video imprint representation is employed to capture the spatial/temporal interdependences within/among frames and represent them as spatial proximity in a feature space. Subsequently, the obtained imprint representation is spatially segmented by a fully convolutional network. With such segmentation labels projected back to the video space, both temporal action boundary localization and per-frame spatial annotation can be obtained simultaneously. The proposed framework is robust to variable lengths of untrimmed videos, due to the underlying fixed-size imprint representations. The efficacy of the framework is validated in two public action detection datasets.

🚀 Conference Pioneer — AAAI 2019

🐣 Hot Topic Early Bird — video processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Zhanning Gao , Le Wang , Qilin Zhang , Zhenxing Niu , Nanning Zheng , Gang Hua

Topics

Computer Vision > Analysis > Action Recognition Computer Vision > Processing > Video Processing Computer Vision > Processing > Semantic Segmentation Computer Vision > Analysis > Video Understanding

Keywords

semantic segmentation action localization video processing untrimmed video video representation fully convolutional network temporal action detection video imprint

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019