Abstract
In this paper we describe the newly created eye tracking annotated database Eye-Tracking Movie Database ETMD and give some preliminary experimental results on this dataset using our new visual saliency frontend. We have developed a database with eye-tracking human annotation that comprises video clips from Hollywood movies, which are longer in duration than the existing databases’ videos and include more complex semantics. Our proposed visual saliency frontend is based on both low-level features, such as intensity, color and spatio-temporal energy, and face detection results and provides a single saliency volume map. The described new eye-tracking database can become useful in many applications while our computational frontend shows to be promising as it gave good results on predicting the eye’s fixation according to certain metrics.
Chapter PDF
Similar content being viewed by others
Keywords
References
Borji, A., Itti, L.: State-of-the-art in visual attention modeling. IEEE Trans. Pattern Analysis and Machine Intelligence 35(1), 185–207 (2013)
Evangelopoulos, G., Zlatintsi, A., Potamianos, A., Maragos, P., Rapantzikos, K., Skoumas, G., Avrithis, Y.: Multimodal saliency and fusion for movie summarization based on aural, visual, textual attention. IEEE Trans. on Multimedia 15(7) (2013)
Treisman, A., Gelade, G.: A feature integration theory of attention. Cognit. Psychology 12(1), 97–136 (1980)
Koch, C., Ullman, S.: Shifts in selective visual attention: towards the underlying neural circuitry. Human Neurobiology 4(4), 219–227 (1985)
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Analysis and Machine Intelligence 20(11), 1254–1259 (1998)
Borji, A., Sihite, D.N., Itti, L.: Quantitative analysis of human-model agreement in visual saliency modeling: A comparative study. IEEE Trans. Image Processing 22(1), 55–69 (2013)
Poynton, C.: Digital Video and HD: Algorithms and Interfaces, 2nd edn. Morgan Kaufmann (2012)
Adelson, E.H., Bergen, J.R.: Spatiotemporal energy models for the perception of motion. J. Opt. Soc. Amer. A 2(2), 284–299 (1985)
Heeger, D.J.: Model for the extraction of image flow. J. Opt. Soc. Amer. 4(8), 1455–1471 (1987)
Bovik, A.C., Gopal, N., Emmoth, T., Restrepo, A.: Localized Measurement of Emergent Image Frequencies by Gabor Wavelets. IEEE Trans. Information Theory 38, 691–712 (1992)
Havlicek, J.P., Harding, D.S., Bovik, A.C.: Multidimensional quasi-eigenfunction approximations and multicomponent am-fm models. IEEE Trans. Image Processing 9(2), 227–242 (2000)
Viola, P., Jones, M.J.: Robust real-time face detection. Int’l. J. Comput. Vis. 57(2), 137–154 (2004)
Wyszecki, G., Stiles, W.S.: Color Science, 2nd edn. J. Wiley & Sons, NY (1982)
Gabor, D.: Theory of Communication. IEE Journal (London) 93, 429–457 (1946)
Daugman, J.: Uncertainty Relation for Resolution in Space, Spatial Frequency and Orientation Optimized by Two-Dimensional Visual Cortical Filters. J. Opt. Soc. Amer. A 2(7), 1160–1169 (1985)
Daugman, J.G.: Two-dimensional spectral analysis of cortical receptive field profiles. Vision Research 20(10), 847–856 (1980)
Green, D.M., Swets, J.A.: Signal detection theory and psychophysics. Wiley, New York (1966)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Koutras, P., Katsamanis, A., Maragos, P. (2014). Predicting Eyes’ Fixations in Movie Videos: Visual Saliency Experiments on a New Eye-Tracking Database. In: Harris, D. (eds) Engineering Psychology and Cognitive Ergonomics. EPCE 2014. Lecture Notes in Computer Science(), vol 8532. Springer, Cham. https://doi.org/10.1007/978-3-319-07515-0_19
Download citation
DOI: https://doi.org/10.1007/978-3-319-07515-0_19
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07514-3
Online ISBN: 978-3-319-07515-0
eBook Packages: Computer ScienceComputer Science (R0)