Article

Sampling strategies for bag-of-features image classification

Authors:

Frédéric Jurie,

Bill TriggsAuthors Info & Claims

ECCV'06: Proceedings of the 9th European conference on Computer Vision - Volume Part IV

Pages 490 - 503

https://doi.org/10.1007/11744085_38

Published: 07 May 2006 Publication History

Abstract

Bag-of-features representations have recently become popular for content based image classification owing to their simplicity and good performance. They evolved from texton methods in texture analysis. The basic idea is to treat images as loose collections of independent patches, sampling a representative set of patches from the image, evaluating a visual descriptor vector for each patch independently, and using the resulting distribution of samples in descriptor space as a characterization of the image. The four main implementation choices are thus how to sample patches, how to describe them, how to characterize the resulting distributions and how to classify images based on the result. We concentrate on the first issue, showing experimentally that for a representative selection of commonly used test databases and for moderate to large numbers of samples, random sampling gives equal or better classifiers than the sophisticated multiscale interest operators that are in common use. Although interest operators work well for small numbers of samples, the single most important factor governing performance is the number of patches sampled from the test image and ultimately interest operators can not provide enough patches to compete. We also study the influence of other factors including codebook size and creation method, histogram normalization method and minimum scale for feature extraction.

References

[1]

Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: ECCV'04 workshop on Statistical Learning in Computer Vision. (2004) 59-74.

[2]

Fergus, R., Perona, P., Zisserman, A.: Object class recognition by unsupervised scale-invariant learning. In: CVPR03. (2003) II: 264-271.

[3]

Leung, T., Malik, J.: Representing and recognizing the visual appearance of materials using three-dimensional textons. IJCV 43 (2001) 29-44.

Digital Library

[4]

Agarwal, S., Awan, A., Roth, D.: Learning to detect objects in images via a sparse, part-based representation. PAMI 26 (2004) 1475-1490.

Digital Library

[5]

Fergus, R., Fei-Fei, L., Perona, P., Zisserman, A.: Learning object categories from google's image search. In: ICCV. (2005) II: 1816-1823.

Digital Library

[6]

Grauman, K., Darrell, T.: Efficient image matching with distributions of local invariant features. In: CVPR05. (2005) II: 627-634.

Digital Library

[7]

Leibe, B., Schiele, B.: Interleaved object categorization and segmentation. In: BMVC. (2003).

[8]

Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60 (2004) 91-110.

Digital Library

[9]

Mikolajczyk, K., Schmid, C.: An affine invariant interest point detector. In: ECCV. (2002) I: 128.

Digital Library

[10]

Sivic, J., Zisserman, A.: Video google: A text retrieval approach to object matching in videos. In: ICCV03. (2003) 1470-1477.

Digital Library

[11]

Weber, M., Welling, M., Perona, P.: Unsupervised learning of models for recognition. In: ECCV. (2000) I: 18-32.

Digital Library

[12]

Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: ICCV. (2005).

Digital Library

[13]

Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: ICCV. (2005).

Digital Library

[14]

Bouchard, G., Triggs, B.: Hierarchical part-based visual object categorization. In: CVPR. Volume 1. (2005) 710-715.

Digital Library

[15]

Agarwal, A., Triggs, B.: Hyperfeatures - multilevel local coding for visual recognition. In: ECCV. (2006).

Digital Library

[16]

Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: ECML-98, 10th European Conference on Machine Learning, Springer Verlag (1998) 137-142.

Digital Library

[17]

Niblack, W., Barber, R., Equitz, W., Flickner, M., Glasman, D., Petkovic, D., Yanker, P.: The qbic project: Querying image by content using color, texture, and shape. SPIE 1908 (1993) 173-187.

[18]

Lazebnik, S., Schmid, C., Ponce, J.: Affine-invariant local descriptors and neighborhood statistics for texture recognition. In: ICCV. (2003) 649-655.

Digital Library

[19]

Rubner, Y., Tomasi, C., Guibas, L.: The earth mover's distance as a metric for image retrieval. IJCV 40 (2000) 99-121.

Digital Library

[20]

Mikolajczyk, K., Tuytelaars, T., Schmid, C., Zisserman, A., Matas, J., Schaffalitzky, F., Kadir, T., Gool, L.V.: A comparison of affine region detectors. Int. J. Computer Vision 65 (2005) 43-72.

Digital Library

[21]

Lindeberg, T.: Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention. IJCV 11 (1993) 283-318.

Digital Library

[22]

Nowak, E., Jurie, F.: Vehicle categorization: Parts for speed and accuracy. In: VS-PETS workshop, in conjuction with ICCV 05. (2005).

[23]

et al., M.E.: The 2005 pascal visual object classes challenge. In Springer-Verlag, ed.: First PASCAL Challenges Workshop. (2006).

[24]

Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classifcation of texture and object categories: An in-depth study. Technical Report RR-5737, INRIA Rhône-Alpes, 665 avenue de l'Europe, 38330 Montbonnot, France (2005).

Cited By

Velayudhan DHassan TDamiani EWerghi N(2022)Recent Advances in Baggage Threat Detection: A Comprehensive and Systematic SurveyACM Computing Surveys10.1145/354993255:8(1-38)Online publication date: 23-Dec-2022
https://dl.acm.org/doi/10.1145/3549932
Guo JLi YWu HWang J(2021)Innovative chest X-ray image recognition technique and its economic valuePersonal and Ubiquitous Computing10.1007/s00779-021-01627-z27:4(1551-1559)Online publication date: 19-Aug-2021
https://dl.acm.org/doi/10.1007/s00779-021-01627-z
Govender DTapamo J(2020)Factors Affecting the Cost to Accuracy Balance for Real-Time Video-Based Action RecognitionComputational Science and Its Applications – ICCSA 202010.1007/978-3-030-58799-4_58(807-818)Online publication date: 1-Jul-2020
https://dl.acm.org/doi/10.1007/978-3-030-58799-4_58
Show More Cited By

Sampling strategies for bag-of-features image classification
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
  2. Machine learning

Recommendations

A New Bag of Words LBP BoWL Descriptor for Scene Image Classification
CAIP 2013: Proceedings, Part I, of the 15th International Conference on Computer Analysis of Images and Patterns - Volume 8047

This paper explores a new Local Binary Patterns LBP based image descriptor that makes use of the bag-of-words model to significantly improve classification performance for scene images. Specifically, first, a novel multi-neighborhood LBP is introduced ...
Bag of Hierarchical Co-occurrence Features for Image Classification
ICPR '10: Proceedings of the 2010 20th International Conference on Pattern Recognition

We propose a bag-of-hierarchical-co-occurrence features method incorporating hierarchical structures for image classification. Local co-occurrences of visual words effectively characterize the spatial alignment of objects’ components. The visual words ...
Region-Based Spatial Sampling for Image Classification
ICIG '13: Proceedings of the 2013 Seventh International Conference on Image and Graphics

Local descriptors with Bag-of-Words representation were widely used for image classification. Especially, local descriptors of dense spatial sampling were demonstrated to be able to further improve performances of image classification. However, denser ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

ECCV'06: Proceedings of the 9th European conference on Computer Vision - Volume Part IV

May 2006

613 pages

ISBN:3540338381

Editors:
Aleš Leonardis
University of Ljubljana, Slovenia
,
Horst Bischof
Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, Graz, Austria
,
Axel Pinz
Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Inffeldgasse 16, Graz, Austria

Sponsors

University of Ljubljana: University of Ljubljana
Graz University of Technology: Graz University of Technology
Advanced Computer Vision: Advanced Computer Vision

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 07 May 2006

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

149
Total Citations
View Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 24 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Velayudhan DHassan TDamiani EWerghi N(2022)Recent Advances in Baggage Threat Detection: A Comprehensive and Systematic SurveyACM Computing Surveys10.1145/354993255:8(1-38)Online publication date: 23-Dec-2022
https://dl.acm.org/doi/10.1145/3549932
Guo JLi YWu HWang J(2021)Innovative chest X-ray image recognition technique and its economic valuePersonal and Ubiquitous Computing10.1007/s00779-021-01627-z27:4(1551-1559)Online publication date: 19-Aug-2021
https://dl.acm.org/doi/10.1007/s00779-021-01627-z
Govender DTapamo J(2020)Factors Affecting the Cost to Accuracy Balance for Real-Time Video-Based Action RecognitionComputational Science and Its Applications – ICCSA 202010.1007/978-3-030-58799-4_58(807-818)Online publication date: 1-Jul-2020
https://dl.acm.org/doi/10.1007/978-3-030-58799-4_58
Mathuria AHammerstrom DPotok TSchuman C(2019)Approximate Pattern Matching using Hierarchical Graph Construction and Sparse Distributed RepresentationProceedings of the International Conference on Neuromorphic Systems10.1145/3354265.3354286(1-10)Online publication date: 23-Jul-2019
https://dl.acm.org/doi/10.1145/3354265.3354286
Yang RWang HDou W(2019)Acoustic event detection with two-stage judgement in the noisy environmentProceedings of the ACM Turing Celebration Conference - China10.1145/3321408.3326655(1-7)Online publication date: 17-May-2019
https://dl.acm.org/doi/10.1145/3321408.3326655
Nanni LBrahnam SGhidoni SLumini A(2019)Bioimage Classification with Handcrafted and Learned FeaturesIEEE/ACM Transactions on Computational Biology and Bioinformatics10.1109/TCBB.2018.282112716:3(874-885)Online publication date: 1-May-2019
https://dl.acm.org/doi/10.1109/TCBB.2018.2821127
Ghodrati HLuciano LHamza A(2019)Convolutional Shape-Aware Representation for 3D Object ClassificationNeural Processing Letters10.1007/s11063-018-9858-949:2(797-817)Online publication date: 1-Apr-2019
https://dl.acm.org/doi/10.1007/s11063-018-9858-9
Sert MBoyacı E(2019)Sketch recognition using transfer learningMultimedia Tools and Applications10.1007/s11042-018-7067-178:12(17095-17112)Online publication date: 1-Jun-2019
https://dl.acm.org/doi/10.1007/s11042-018-7067-1
Srivastava DBakthula RAgarwal S(2019)Image classification using SURF and bag of LBP features constructed by clustering with fixed centersMultimedia Tools and Applications10.1007/s11042-018-6793-878:11(14129-14153)Online publication date: 1-Jun-2019
https://dl.acm.org/doi/10.1007/s11042-018-6793-8
Liu YYu DChen XLi ZFan J(2019)TOP-SIFTThe Visual Computer: International Journal of Computer Graphics10.1007/s00371-018-1502-135:5(667-677)Online publication date: 1-May-2019
https://dl.acm.org/doi/10.1007/s00371-018-1502-1
Show More Cited By

View Options

View options

Media

Figures

Other

Tables

View Table of Contents