How Much Time Do You Have? Modeling Multi-Duration Saliency

Camilo Fosco; Anelise Newman; Pat Sukhum; Yun Bin Zhang; Nanxuan Zhao; Aude Oliva; Zoya Bylinskii

2020 CVPR CVPR 2020

How Much Time Do You Have? Modeling Multi-Duration Saliency

Abstract

What jumps out in a single glance of an image is different than what you might notice after closer inspection. Yet conventional models of visual saliency produce predictions at an arbitrary, fixed viewing duration, offering a limited view of the rich interactions between image content and gaze location. In this paper we propose to capture gaze as a series of snapshots, by generating population-level saliency heatmaps for multiple viewing durations. We collect the CodeCharts1K dataset, which contains multiple distinct heatmaps per image corresponding to 0.5, 3, and 5 seconds of free-viewing. We develop an LSTM-based model of saliency that simultaneously trains on data from multiple viewing durations. Our Multi-Duration Saliency Excited Model (MD-SEM) achieves competitive performance on the LSUN 2017 Challenge with 57% fewer parameters than comparable architectures. It is the first model that produces heatmaps at multiple viewing durations, enabling applications where multi-duration saliency can be used to prioritize visual content to keep, transmit, and render.

❓ The Questioner

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

🧭 Keyword Pioneer — multi-duration learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Camilo Fosco , Anelise Newman , Pat Sukhum , Yun Bin Zhang , Nanxuan Zhao , Aude Oliva , Zoya Bylinskii

Topics

Machine Learning > Core Methods > Regression Deep Learning > Architectures > Neural Networks Machine Learning > Learning Types > Multi-Task Learning Deep Learning > Learning Types > Representation Learning Computer Vision > Analysis > Computer Vision Deep Learning > Architectures > Recurrent Neural Networks

Keywords

visual saliency long short-term memory saliency prediction heatmap prediction multi-duration learning multi-duration saliency

Download PDF

Related papers

Deep Polarization Cues for Transparent Object Segmentation 2020

HRank: Filter Pruning Using High-Rank Feature Map 2020

Panoptic-Based Image Synthesis 2020

Select, Supplement and Focus for RGB-D Saliency Detection 2020

ClusterVO: Clustering Moving Instances and Estimating Visual Odometry for Self and Surroundings 2020