Semantically-Based Human Scanpath Estimation with HMMs

Huiying Liu; Dong Xu; Qingming Huang; Wen Li; Min Xu; Stephen Lin

2013 ICCV ICCV 2013

Semantically-Based Human Scanpath Estimation with HMMs

Abstract

We present a method for estimating human scanpaths, which are sequences of gaze shifts that follow visual attention over an image. In this work, scanpaths are modeled based on three principal factors that influence human attention, namely low-level feature saliency, spatial position, and semantic content. Low-level feature saliency is formulated as transition probabilities between different image regions based on feature differences. The effect of spatial position on gaze shifts is modeled as a Levy flight with the shifts following a 2D Cauchy distribution. To account for semantic content, we propose to use a Hidden Markov Model (HMM) with a Bag-of-Visual-Words descriptor of image regions. An HMM is well-suited for this purpose in that 1) the hidden states, obtained by unsupervised learning, can represent latent semantic concepts, 2) the prior distribution of the hidden states describes visual attraction to the semantic concepts, and 3) the transition probabilities represent human gaze shift patterns. The proposed method is applied to task-driven viewing processes. Experiments and analysis performed on human eye gaze data verify the effectiveness of this method.

🚀 Conference Pioneer — ICCV 2013

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision and Interdisciplinary and Machine Learning

🧭 Keyword Pioneer — semantic content

🐣 Hot Topic Early Bird — visual attention

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Huiying Liu , Dong Xu , Qingming Huang , Wen Li , Min Xu , Stephen Lin

Topics

Artificial Intelligence > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Learning Types > Unsupervised Learning Computer Vision > Analysis > Human Analysis Computer Vision > Analysis > Scene Understanding Interdisciplinary > Cognitive Science > Perception Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling

Keywords

gaze prediction visual attention hidden markov model gaze estimation semantic content bag of visual word

Download PDF

Related papers

Large-Scale Multi-resolution Surface Reconstruction from RGB-D Sequences 2013

Cascaded Shape Space Pruning for Robust Facial Landmark Detection 2013

Unsupervised Intrinsic Calibration from a Single Frame Using a "Plumb-Line" Approach 2013

Accurate and Robust 3D Facial Capture Using a Single RGBD Camera 2013

From Where and How to What We See 2013