Abstract
When images are described with visual words based on vector quantization of low-level color, texture, and edge-related visual features of image regions, it is usually referred as “bag-of-visual words (BoVW)”-based presentation. Although it has proved to be effective for image representation similar to document representation in text retrieval, the hard image encoding approach based on one-to-one mapping of regions to visual words is not expressive enough to characterize the image contents with higher level semantics and prone to quantization error. Each word is considered independent of all the words in this model. However, it is found that the words are related and their similarity of occurrence in documents can reflect the underlying semantic relations between them. To consider this, a soft image representation scheme is proposed by spreading each region’s membership values through a local fuzzy membership function in a neighborhood to all the words in a codebook generated by self-organizing map (SOM). The topology preserving property of the SOM map is exploited to generate a local membership function. A systematic evaluation of retrieval results of the proposed soft representation on two different image (natural photographic and medical) collections has shown significant improvement in precision at different recall levels when compared to different low-level and “BoVW”-based feature that consider only probability of occurrence (or presence/absence) of a word.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Bezdek JC (1981) Pattern recognition with fuzzy objective function algorithms. Plenum, New York
Bezdek JC, Pal SK (1992) Fuzzy models for pattern recognition: methods that search for structures in data. IEEE Press, NY
Bezdek JC, Pal MR, Keller J, Krisnapuram R (1999) Fuzzy models and algorithms for pattern recognition and image processing. Kluwer Academic Publishers, Boston
Chang SF, Sikora T, Puri A (2001) Overview of the MPEG-7 standard. IEEE Trans Circ Syst Video Technol 11:688–695
Chang E, Kingshy G, Sychay G, Gang W (2003) CBSA: content-based soft annotation for multimodal Image retrieval using Bayes point machines. IEEE Trans Circ Syst Video Tech 13:26–38
Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):1–60
Duygulu P, Barnard K, Freitas N, Forsyth D (2002) Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Proc. Seventh European Conf. on Computer Vision. pp 97–112
Fukunaga K (1990) Introduction to statistical pattern recognition, 2nd edn. Academic Press, San Diego, CA
Grubinger M, Clough PD, Müller H, Deselaers T (2006) The IAPR benchmark: a new evaluation resource for visual information systems. In: International Conference on Language Resources and Evaluation, Genoa, Italy
Han J, Ma KK (2002) Fuzzy color histogram and its use in color image retrieval. IEEE Trans Image Process 11(8):944–952
Haralick RM, Shanmugam K, Dinstein I (1973) Textural features for image classification. IEEE Trans Syst Man Cybern 3:610–621
Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323
Jing F, Li M, Zhang HJ, Zhang B (2004) An efficient and effective region-based image retrieval framework. IEEE Trans Image Process 13:699–709
John PE (2002) Towards Intelligent image retrieval. Pattern Recogn 35:3–14
Kohonen T (1997) Self-organizing maps, 2nd edn. Springer, Heidelberg
Laaksonen J, Koskela M, Oja E (2002) PicSOM: self-organizing image retrieval With MPEG-7 content descriptors. IEEE Trans Neural Netw 13(4):841–853
Lim JH (2000) Explicit query formulation with visual keywords. In: Proc. eighth ACM international conference on Multimedia. pp 407–412
Liua Y, Zhang D, Lu G, Ma WY (2007) A survey of content-based image retrieval with high-level semantics. Pattern Recog 40:262–282
Mitra S, Pal SK (1994) Self-organizing neural network as a fuzzy classifier. IEEE Trans Syst Man Cybernet 24(3):385–399
Müller H, Deselaers T, Kim E, Kalpathy C, Jayashree D, Thomas M, Clough P, Hersh W (2008) Overview of the ImageCLEFmed 2007 Medical Retrieval and Annotation Tasks, 8th Workshop of the Cross-Language Evaluation Forum (CLEF 2007). In: Proceedings of LNCS, vol 5152
Pei SC, Lo YS (1998) Color image compression and limited display using self-organization Kohonen map. IEEE Trans Circ Syst Video Tech 8(2):191–205
Rahman MM, Bhattacharya P, Desai BC (2009) A unified image retrieval framework on local visual and semantic concept-based feature spaces. J Vis Commun Image Represent 20:450–462
Rui Y, Huang TS, Chang SF (1999) Image retrieval: current techniques. Promising directions and open issues. J Vis Comm and Image Rep 10:39–62
Smeulders A, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Pattern Anal Machine Intell 22:1349–1380
Vesanto J (1999) SOM-based data visualization methods. Intell Data Anal 3(2):111–126
Vogel J, Schiele B (2007) Semantic modeling of natural scenes for content-based image retrieval. Int J Comput Vis 72(2):133–157
Yang CC, Bose NK (2006) Generating fuzzy membership function with self-organizing feature map. Pattern Recog Lett 27(5):356–365
Yates RB, Neto BR (1999) Modern information retrieval. Addison Wesley
Zhu L, Zhang A, Rao A, Srihari R (2002) Theory of keyblock-based image retrieval. ACM Trans Inf Syst 20(2):224–257 (ISSN: 1046–8188)
Acknowledgments
This research is partially supported by a faculty development fund from the School of Computer, Mathematical and Natural Sciences (SCMNS), Morgan State University, Baltimore, Maryland, USA. The author would like to thank the IAPR Technical Committee TC-12 and ImageCLEFmed (Müller et al. 2008) organizers for making the databases available for the experiments.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by V. Loia.
Rights and permissions
About this article
Cite this article
Rahman, M.M. A soft image representation approach by exploiting local neighborhood structure of self-organizing map (SOM). Soft Comput 20, 2759–2769 (2016). https://doi.org/10.1007/s00500-015-1675-8
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-015-1675-8