research-article

Markov chain based computational visual attention model that learns from eye tracking data

Authors:

Wenhu WangAuthors Info & Claims

Pattern Recognition Letters, Volume 49, Issue C

Pages 1 - 10

Published: 01 November 2014 Publication History

Abstract

We use Markov chain to model the visual attention.Our visual attention model is based on low level and high level image features.We use the real eye tracking data to train our visual attention model.We measure performances of attention models by comparing them with human fixations.Our model is more consistency with the attentional deployment of humans. Computational visual attention models are a topic of increasing importance in computer understanding of images. Most existing attention models are based on bottom-up computation that often does not match actual human attention. To address this problem, we propose a novel visual attention model that is learned from actual eye tracking data. We use a Markov chain to model the relationship between the image feature and the saliency, then train a support vector regression (SVR) from true eye tracking data to predict the transition probabilities of the Markov chain. Finally, a saliency map predicting user's attention is obtained from the stationary distribution of this chain. Our experimental evaluations on several benchmark datasets demonstrate that the results of the proposed approach are comparable with or outperform the state-of-art models on prediction of human eye fixations and interest region detection.

References

[1]

G. Boccignone, M. Ferraro, Modelling gaze shift as a constrained random walk, Phys. A - Stat. Mech. Appl., 331 (2004) 207-218.

[2]

C. Brezinski, M. Redivo-Zaglia, The PageRank vector: properties, computation, approximation, and acceleration, SIAM J. Matrix Anal. Appl., 28 (2006) 551-575.

Digital Library

[3]

M. Cerf, E.P. Frady, C. Koch, Faces and text attract gaze independent of the task: experimental data and computer model, J. Vision, 9 (2009) 10.

[4]

C. Chang, C.-J. Lin, Libsvm: a library for support vector machines. Linux J., 2001.

[5]

B.A. Draper, A. Lionelle, Evaluation of selective attention under similarity transformations, Comput. Vis. Image Understanding, 100 (2005) 152-171.

Digital Library

[6]

W. Einhäuser, M. Spain, P. Perona, Objects predict fixations better than early saliency, J. Vision, 8 (2008).

[7]

E. Erdem, A. Erdem, Visual saliency estimation by nonlinearly integrating features using region covariances, J. Vision, 13 (2013) 11.

[8]

M. Everingham, L. Van Gool, C.K.I. Williams, J. Winn, A. Zisserman, n.d., The PASCAL Visual Object Classes Challenge 2012 (VOC2012) Results.

[9]

P.F. Felzenszwalb, D.A. Mcallester, D. Ramanan, A discriminatively trained, multiscale, deformable part model, in: Computer Vision and Pattern Recognition, 2008, pp. 1-8, http://dx.doi.org/10.1109/CVPR.2008.4587597.

[10]

S. Frintrop, E. Rome, H.I. Christensen, Computational visual attention systems and their cognitive foundations: a survey, ACM Trans. Appl. Percept., 7 (2010) 1-39.

Digital Library

[11]

H. Fu, Z. Chi, D. Feng, Attention-driven image interpretation with application to image retrieval, Pattern Recognit., 39 (2006) 1604-1621.

Digital Library

[12]

D. Gao, V. Mahadevan, N. Vasconcelos, On the plausibility of the discriminant center-surround hypothesis for visual saliency, J. Vision, 8 (2008).

[13]

H. Greenspan, S. Belongie, R. Goodman, P. Perona, S. Rakshit, C.H. Anderson, Overcomplete steerable pyramid filters and rotation invariance, in: Computer Vision and Pattern Recognition, 1994, pp. 222-228, http://dx.doi.org/10.1109/CVPR.1994.323833.

[14]

J. Harel, C. Koch, P. Perona, Graph-based visual saliency, in: Neural Information Processing Systems, 2006, pp. 545-552.

[15]

L. Itti, C. Koch, E. Niebur, A model of saliency-based visual attention for rapid scene analysis, IEEE Trans. Pattern Anal. Mach. Intell., 20 (1998) 1254-1259.

Digital Library

[16]

T. Judd, K.A. Ehinger, F. Durand, A. Torralba, Learning to predict where humans look, in: International Conference on Computer Vision, 2009, pp. 2106-2113, http://dx.doi.org/10.1109/ICCV.2009.5459462.

[17]

W. Kienzle, F.A. Wichmann, B. Schölkopf, M.O. Franz, A nonparametric approach to bottom-up visual saliency, in: Neural Information Processing Systems, 2006, pp. 689-696.

[18]

Z. Liang, H. Fu, Z. Chi, D.D. Feng, Refining a region based attention model using eye tracking data, in: Image Processing, IEEE International Conference, 2010, pp. 1105-1108, http://dx.doi.org/10.1109/ICIP.2010.5651804.

[19]

H. Liu, I. Heynderickx, Studying the added value of visual attention in objective image quality metrics based on eye movement data, in: Image Processing, IEEE International Conference, 2009, pp. 3097-3100, http://dx.doi.org/10.1109/ICIP.2009.5414466.

Digital Library

[20]

O.L. Meur, P.L. Callet, D. Barba, D. Thoreau, A coherent computational approach to model bottom-up visual attention, IEEE Trans. Pattern Anal. Mach. Intell., 28 (2006) 802-817.

Digital Library

[21]

N. Otsu, A threshold selection method from gray-level histograms, IEEE Trans. Syst. Man Cybernet., 9 (1979) 62-66.

[22]

C.M. Privitera, L.W. Stark, Algorithms for defining visual regions-of-interest: comparison with eye fixations, IEEE Trans. Pattern Anal. Mach. Intell., 22 (2000) 970-982.

Digital Library

[23]

M. Rubinstein, A. Shamir, S. Avidan, Improved seam carving for video retargeting, ACM Trans. Graphics, 27 (2008).

Digital Library

[24]

U. Rutishauser, D. Walther, C. Koch, P. Perona, Is bottom-up attention useful for object recognition? in: Computer Vision and Pattern Recognition, 2004, pp. 37-44, http://dx.doi.org/10.1109/CVPR.2004.1315142.

Digital Library

[25]

J. Sivic, B.C. Russell, A.A. Efros, A. Zisserman, W.T. Freeman, Discovering objects and their localization in images, in: International Conference on Computer Vision, 2005, pp. 370-377, http://dx.doi.org/10.1109/ICCV.2005.77.

Digital Library

[26]

A. Torralba, A. Oliva, M.S. Castelhano, J.M. Henderson, Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search, Psychol. Rev., 113 (2006) 766-786.

[27]

P.A. Viola, M.J. Jones, Robust real-time face detection, Int. J. Comput. Vision, 57 (2004) 137-154.

Digital Library

[28]

D. Walther, C. Koch, Modeling attention to salient proto-objects, Neural Networks, 19 (2006) 1395-1407.

Digital Library

[29]

Z. Wang, L. Lu, A.C. Bovik, Foveation scalable video coding with automatic fixation selection, IEEE Trans. Image Process., 12 (2003) 243-254.

Digital Library

[30]

L. Zhang, M.H. Tong, T.K. Marks, H. Shan, G.W. Cottrell, SUN: a Bayesian framework for saliency using natural statistics, J. Vision, 8 (2008) 32.

[31]

Q. Zhao, C. Koch, Learning saliency-based visual attention: a review, Signal Process., 93 (2013) 1401-1407.

Digital Library

Cited By

Zammarchi GFrigau LMola F(2021)Markov chain to analyze web usability of a university website using eye tracking dataStatistical Analysis and Data Mining10.1002/sam.1151214:4(331-341)Online publication date: 10-May-2021
https://dl.acm.org/doi/10.1002/sam.11512

Markov chain based computational visual attention model that learns from eye tracking data
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
      2. Computer vision tasks

Recommendations

Human Visual Scanpath Prediction Based on RGB-D Saliency
ICIGP '18: Proceedings of the 2018 International Conference on Image and Graphics Processing

Human visual perception is considered as a dynamic process of information acquisition, while the visual scanpath can clearly reflect the shift of our eye fixations. In the previous study of visual attention, researchers generally do the saliency ...
A Task-Driven Eye Tracking Dataset for Visual Attention Analysis
ACIVS 2015: Proceedings of the 16th International Conference on Advanced Concepts for Intelligent Vision Systems - Volume 9386

To facilitate the research in visual attention analysis, we design and establish a new task-driven eye tracking dataset of 47 subjects. Inspired by psychological findings that human visual behavior is tightly dependent on the executed tasks, we carefully ...
Relevance of a feed-forward model of visual attention for goal-oriented and free-viewing tasks

A purely bottom-up model of visual attention is proposed and compared to five state-of-the-art models. The role of the low-level visual features is examined in two contexts. Two datasets are used: one containing data coming from an eye tracking ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Pattern Recognition Letters

Pattern Recognition Letters Volume 49, Issue C

November 2014

264 pages

ISSN:0167-8655

Issue’s Table of Contents

Copyright © Elsevier B.V.

Publisher

Elsevier Science Inc.

United States

Publication History

Published: 01 November 2014

Author Tags

Qualifiers

Research-article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 24 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Zammarchi GFrigau LMola F(2021)Markov chain to analyze web usability of a university website using eye tracking dataStatistical Analysis and Data Mining10.1002/sam.1151214:4(331-341)Online publication date: 10-May-2021
https://dl.acm.org/doi/10.1002/sam.11512

View Options

View options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Issue’s Table of Contents