Abstract
Multimedia databases (in particular image databases) are different from traditional system since they cannot ignore the perceptual substratum on which the data come. There are several consequences of this fact. The most relevant for our purposes is that it is no longer possible to identify a well defined meaning of an image and, therefore, matching based on meaning is impossible. Matching should be replaced by similarity assessment and, in particular, by something close to human preattentive similarity.
In this paper we propose a geometric model of similarity measurement that subsumes most of the models proposed for psychological similarity.
Similar content being viewed by others
References
F. Gregory Ashby and Nancy A. Perrin, “Toward a unified theory of similarity and recognition,” Psychological Review, Vol. 95, No. 1, pp. 124–150, 1988.
J.R. Bach, S. Paul, and R. Jain, “A visual information management system for the interactive retrieval of faces,” IEEE Transactions on Knowledge and Data Engineering, Vol. 5, No. 4, pp. 619–628, 1993.
Roger G. Barker, “On the nature of the environment,” in The Psychology of Egon Brunswik, Kenneth R. Hammond (Ed.), Holt, Reinheart and Winston, 1966.
J. Beck, “Textural segmentation,” in Organization and Representation in Perception, J. Beck (Ed.), Erlbaum, 1982.
William M. Boothby, “An introduction to differentiable manifolds and Riemannian geometry,” Pure and Applied Mathematics, Academic Press, 1975.
Vicki Bruce and Patrick Green, “Visual perception: Physiology, psychology, and ecology,” Lawrence Erlbaum Associates, 1985.
Egon Brunswik, “Perception and the representative design of psychological experiments,” University of California Press, 1956.
J. Douglas Carroll and Phipps Arabie, “Multidimensional scaling,” Annual Review of Psychology, Vol. 31, pp. 607–649, 1980.
Shih Fu Chang and John R. Smith, “Extracting multi-dimensional signal features for content-based visual query,” in SPIE Symposium on Communications and Signal Processing, 1995.
H. Chen, B. Schatz, T. Ng, J. Martinez, A. Kirchoff, and C. Lin, “A parallel computing approach to creating engineering concept spaces for semantic retrieval: The illinois digital library initiative project,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, Aug. 1996.
Daniel M. Ennis, Joseph J. Palen, and Kenneth Mullen, “A multidimensional stochastic theory of similarity,” Journal of Mathematical Psychology, Vol. 32, pp. 449–465, 1988.
Myron Flickner, Harpreet Sawhney, Wayne Niblack, Jonathan Ashley, Qian Huang, Byron Dom, Monika Gorkani, Jim Hafner, Denis Lee, Dragutin Petkovic, David Steele, and Peter Yanker, “Query by image and video content: The QBIC system,” IEEE Computer, 1995.
J.J. Gibson, The Ecological Approach to Visual Perception, Houghton Mifflin, 1979.
Alvin G. Goldstein and June Chance, “Measuring psychological similarity of faces,” Bulletin of the Psychonomic Society, Vol. 7, No. 4, pp. 407–408, 1976.
Armanath Gupta, “Visual information retrieval technology: A virage perspective,” Technical report, Virage, Inc., 1995.
J.P. Hailman, Optical Signals: Animal Communication and Light, Indiana University Press, 1977.
Kenneth R. Hammond, “Probabilistic functionalism: Egon brunswik's integration of the history, theory, and method of psychology,” in The Psychology of Egon Brunswik, Kenneth R. Hammond (Ed.), Holt, Reinheart and Winston, 1966, pp. 15–80.
Ronald Henss, “Dimensionen der ahnlichkeit von gesichtern-eine kreuzvalidierung (dimensions of similarity of faces: A cross-validation study),” Zeitschrift fur Experimentelle und Angewandte Psychologie, Vol. 41, No. 3, pp. 398–414, 1994.
Alston S. Householder and Herbert D. Landahl, Mathematical Biophysics of the Central Nervous System, Principia Press: Bloomington, Ind., 1945.
Chih-Cheng Hsu, Wesley W. Chu, and Ricky K. Taira, “A knowledge-based approach for retrieving images by content,” IEEE Transactions on Pattern Analysis and Machine Imtelligence, Vol. 8, No. 4, pp. 522–532, Aug. 1996.
F. Idris and S. Panchanathan, “Image indexing using wavelet vector quantization,” in Proceedings of the SPIE, Vol. 2606—Digital Image Storage and Archiving Systems, Philadelphia, PA, USA, 25-26 Oct., 1995, pp. 269–275.
Johannes Itten, The Art of Color, Reinhold Pub. Corp: New York, 1961.
Charles E. Jacobs, Adam Finkelstein, and Savid H. Salesin, “Fast multiresolution image querying,” in Proceedings of SIGGRAPH 95, Los Angeles, CA. ACM SIGGRAPH, New York, 1995.
Carol L. Krumhansl, “Concerning the applicability of geometric models to similarity data: The interrelationship between similarity and spatial density,” Psychological Review, Vol. 85, pp. 445–463, 1978.
Robert W. Leeper, “A critical consideration of Egon Brunswik's probabilistic functionalism,” in The Psychology of Egon Brunswik, Kenneth R. Hammond (Ed.), Holt, Reinheart and Winston, 1966.
Barry Loewer and Georges Rey, Meaning in Mind: Fodor and his Critics, Blackwell, 1991.
D. Lovelock and H. Rund, Tensors, Differential Forms, and Variational Principles, Dover Books on Advanced Mathematics, 63. Dover Publications, Inc.: New York, 1975, 1989.
Jitendra Malik and Pietro Perona, “Preattentive texture discrimination with early vision mechanisms,” Journal of the Optical Society of America A, Vol. 7, No. 5, 1990.
B.S. Manjunath and W.Y. Ma, “Texture features for browsing and retrieval of image data,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, pp. 837–842, 1996.
Kinya Maruyama, Shusei Minakawa, and Hitoshi Okada, “Face similarity and its effect on length discrimination about eyebrows,” Tohoku Psychologica Folia, Vol. 48, No. 1-4, pp. 107–113, 1989.
Carol Mason and Eric R. Kandel, “Central visual pathways,” in Principles of Neural Science, Eric R. Kandel, James H. Schwartz, and Thomas M. Jessell (Eds.), Appleton & Lange, 1991, Chapt. 30, pp. 420–439.
Mohammad Nabil, Anne H.H. Ngu, and John Sheperd, “Picture similarity retrieval using the 2D projection interval representation,” IEEE Transaction on Knowledge and Data Engineering, Vol. 8, No. 4, pp. 533–539, Aug. 1996.
T. Okubo, Differential Geometry, Monographs and Textbooks in Pure and Applied Mathematics,” Marcel Dekker, Inc.: 270 Madison Ave, New York 10016, 1987.
R.R. Olson and F. Attneave, “What variables produce similarity grouping?” Americal Journal of Psychology, Vol. 83, pp. 1–21, 1970.
A. Pentland, R.W. Picard, and S. Sclaroff, “Photobook: Tools for content-based manipulation of image databases,” in SPIE Conference on Storage and Retrieval of Images and Video Databases II, San Jose, CA, Feb. 1994, Vol. 2185.
Lewis Petrinovich, “Probabilistic functionalism: A concept of research method,” American Psychologist, Vol. 34, No. 5, pp. 373–390, May 1979.
N.K. Ratha, K. Karu, Shaoyun Chen, and A.K. Jain, “A real-time matching system for large fingerprint databases,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, pp. 799–813, 1996.
H. Samet and A. Soffer, “MARCO: MAp retrieval by content,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, pp. 783–798, 1996.
Simone Santini and Ramesh Jain, “Similarity matching,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995 (submitted).
Simone Santini and Ramesh Jain, “Similarity queries in image databases,” in Proceedings of CVPR '96, International IEEE Computer Vision and Pattern Recognition Conference, 1996.
H. Sawhney and S. Ayer, “Compact representation of videos through dominant and multiple motion estimation,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 18, No. 8, Aug. 1996.
Robert Shapley, Terrence Caelli, Stephen Grossberg, Michael Morgan, and Ingo Rentschler, “Computational theories of visual perception,” in Visual Perception: The Neurophysiological Foundation, Lothar Spillman and John S. Werner (Eds.), Academic Press, 1990, pp. 417–448.
Roger N. Shepard, “The analysis of proximities: Multidimensional scaling with unknown distance function,” Part I, Psychometrika, Vol. 27, pp. 125–140, 1962.
Roger N. Shepard, “Toward a universal law of generalization for physical science,” Science, Vol. 237, pp. 1317–1323, 1987.
Hans-Georg Stark and Gernod P. Laufkötter, “Image indexing and content based access to databases of medical images with wavelets,” in Proceedings of the SPIE, Vol. 2569—Wavelet Applications in Signal and Image Processing III, San Diego, CA, USA, 12-14 July, 1995, pp. 790–800.
Warren S. Torgerson, “Multidimensional scaling of similarity,” Psychometrika, Vol. 30, pp. 379–393, 1965.
A. Treisman, “Features and objects in visual processing,” Scientific American, Vol. 255, pp. 114B–125, 1986.
A. Treisman, “Properties, parts, and objects,” in Handbook of Perception and Human Performance, K.R. Boff, L. Kaufman, and J.P. Thomas (Eds.), Wiley, 1987.
Amos Tversky, “Features of similarity,” Psychological Review, Vol. 84, No. 4, pp. 327–352, July 1977.
Amos Tversky and David H. Krantz, “The dimensional representation and the metric structure of similarity data,” Journal of Mathematical Psychology, Vol. 7, pp. 572–597, 1970.
David C. Van Essen, H.R. Newsome, T. William, and Maunsell, “The visual field representation in striate cortex of the macaque monkey: Asymmetries, anisotropies, and individual variability,” Vision Research, Vol. 24, No. 5, pp. 429–448, 1984.
Frank W. Warner, “Foundations of differentiable manifolds and lie groups,” Graduate Texts in Mathematics, 94. Springer-Verlag, 1983.
Hugh R. Wilson, Dennis Levi, Lamberto Maffei, Jyrki Rovamo, and Russel DeValois, “The perception of form, retina to striate cortex,” in Visual Perception: The Neurophysiological Foundation, Lothar Spillman and John S. Werner (Eds.), Academic Press, 1990.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Santini, S., Jain, R. Similarity is a Geometer. Multimedia Tools and Applications 5, 277–306 (1997). https://doi.org/10.1023/A:1009651725256
Issue Date:
DOI: https://doi.org/10.1023/A:1009651725256