When Will You Do What? - Anticipating Temporal Occurrences of Activities

Yazan Abu Farha; Alexander Richard; Juergen Gall

2018 CVPR CVPR 2018

When Will You Do What? - Anticipating Temporal Occurrences of Activities

Abstract

Analyzing human actions in videos has gained increased attention recently. While most works focus on classifying and labeling observed video frames or anticipating the very recent future, making long-term predictions over more than just a few seconds is a task with many practical applications that has not yet been addressed. In this paper, we propose two methods to predict a considerably large amount of future actions and their durations. Both, a CNN and an RNN are trained to learn future video labels based on previously seen content. We show that our methods generate accurate predictions of the future even for long videos with a huge amount of different actions and can even deal with noisy or erroneous input information.

❓ The Questioner

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

📈 Trend Setter — Trajectory Prediction

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yazan Abu Farha , Alexander Richard , Juergen Gall

Topics

Computer Vision > Analysis > Action Recognition Computer Vision > Analysis > Activity Recognition Deep Learning > Learning Types > Deep Learning Computer Vision > Analysis > Trajectory Prediction

Keywords

video prediction action prediction convolutional neural network recurrent neural network long-term prediction activity anticipation temporal prediction action forecasting

Download PDF

Related papers

Multi-Shot Pedestrian Re-Identification via Sequential Decision Making 2018

Multi-Cue Correlation Filters for Robust Visual Tracking 2018

Pointwise Convolutional Neural Networks 2018

Learning Attentions: Residual Attentional Siamese Network for High Performance Online Visual Tracking 2018

Image Generation From Scene Graphs 2018