Learning Video Saliency from Human Gaze Using Candidate Selection

Dmitry Rudoy; Dan B. Goldman; Eli Shechtman; Lihi Zelnik-Manor

2013 CVPR CVPR 2013

Learning Video Saliency from Human Gaze Using Candidate Selection

Abstract

During recent years remarkable progress has been made in visual saliency modeling. Our interest is in video saliency. Since videos are fundamentally different from still images, they are viewed differently by human observers. For example, the time each video frame is observed is a fraction of a second, while a still image can be viewed leisurely. Therefore, video saliency estimation methods should differ substantially from image saliency methods. In this paper we propose a novel method for video saliency estimation, which is inspired by the way people watch videos. We explicitly model the continuity of the video by predicting the saliency map of a given frame, conditioned on the map from the previous frame. Furthermore, accuracy and computation speed are improved by restricting the salient locations to a carefully selected candidate set. We validate our method using two gaze-tracked video datasets and show we outperform the state-of-the-art.

🚀 Conference Pioneer — CVPR 2013

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — gaze tracking

🐣 Hot Topic Early Bird — saliency map

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Dmitry Rudoy , Dan B. Goldman , Eli Shechtman , Lihi Zelnik-Manor

Topics

Machine Learning > Core Methods > Representation Learning Computer Vision > Analysis > Video Understanding Deep Learning > Learning Types > Representation Learning Computer Vision > Analysis > Computer Vision

Keywords

visual saliency saliency map human gaze saliency estimation gaze tracking temporal continuity candidate selection video saliency

Download PDF

Related papers

Nonlinearly Constrained MRFs: Exploring the Intrinsic Dimensions of Higher-Order Cliques 2013

An Approach to Pose-Based Action Recognition 2013

Modeling Actions through State Changes 2013

A Convex Regularizer for Reducing Color Artifact in Color Image Recovery 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences 2013