Learned Region Sparsity and Diversity Also Predicts Visual Attention

Zijun Wei; Hossein Adeli; Minh Hoai Nguyen; Greg Zelinsky; Dimitris Samaras

2016 NIPS NeurIPS 2016

Learned Region Sparsity and Diversity Also Predicts Visual Attention

Abstract

Learned region sparsity has achieved state-of-the-art performance in classification tasks by exploiting and integrating a sparse set of local information into global decisions. The underlying mechanism resembles how people sample information from an image with their eye movements when making similar decisions. In this paper we incorporate the biologically plausible mechanism of Inhibition of Return into the learned region sparsity model, thereby imposing diversity on the selected regions. We investigate how these mechanisms of sparsity and diversity relate to visual attention by testing our model on three different types of visual search tasks. We report state-of-the-art results in predicting the locations of human gaze fixations, even though our model is trained only on image-level labels without object location annotations. Notably, the classification performance of the extended model remains the same as the original. This work suggests a new computational perspective on visual attention mechanisms and shows how the inclusion of attention-based mechanisms can improve computer vision techniques.

🌉 Interdisciplinary Bridge — Interdisciplinary and Machine Learning

🧭 Keyword Pioneer — inhibition of return

🐣 Hot Topic Early Bird — visual attention

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zijun Wei , Hossein Adeli , Minh Hoai Nguyen , Greg Zelinsky , Dimitris Samaras

Topics

Machine Learning > Core Methods > Representation Learning Interdisciplinary > Cognitive Science > Perception

Keywords

image classification visual attention region sparsity inhibition of return gaze fixation

Download PDF

Related papers

Bayesian Intermittent Demand Forecasting for Large Inventories 2016

Dynamic Network Surgery for Efficient DNNs 2016

Beyond Exchangeability: The Chinese Voting Process 2016

Safe and Efficient Off-Policy Reinforcement Learning 2016

Tagger: Deep Unsupervised Perceptual Grouping 2016