Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1282280.1282319acmconferencesArticle/Chapter ViewAbstractPublication PagescivrConference Proceedingsconference-collections
Article

Multi-level local descriptor quantization for bag-of-visterms image representation

Published: 09 July 2007 Publication History

Abstract

In the past, quantized local descriptors have been shown to be a good base for the representation of images, that can be applied to a wide range of tasks. However, current approaches typically consider only one level of quantization to create the final image representation. In this view they somehow restrict the image description to one level of visual detail. We propose to build image representations from multi-level quantization of local interest point descriptors, automatically extracted from the images. The use of this new multi-level representation will allow for the description of fine and coarse local image detail in one framework. To evaluate the performance of our approach we perform scene image classification using a 13-class data set. We show that the use of information from multiple quantization levels increases the classification performance, which suggests that the different granularity captured by the multi-level quantization produces a more discriminant image representation. Moreover, by using a multi-level approach, the time necessary to learn the quantization models can be reduced by learning the different models in parallel.

References

[1]
A. Bosch, A. Zisserman, and X. Munoz. Scene classification via PLSA. In In Proceedings of the European Conference on Computer Vision (ECCV), Graz, Austria, May 2006.
[2]
C. J. C. Burges. A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2(2):121--167, 1998.
[3]
A. de Vries. Content and multimedia database management systems. PhD thesis, Twente University, 1999.
[4]
G. Dorko and C. Schmid. Selection of scale invariant parts for object class recognition. In Proceedings of IEEE International Conference on Computer Vision (ICCV), Nice, Oct. 2003.
[5]
L. Fei-Fei and P. Perona. A Bayesian hierarchical model for learning natural scene categories. In In Proceedings of IEEE Conference in Computer Vision and Pattern Recognition (CVPR), San Diego, Jun. 2005.
[6]
K. Grauman and T. Darrell. The pyramid match kernel: Discriminative classification with sets of image features. In Proceedings of IEEE International Conference on Computer Vision (ICCV), 2005.
[7]
F. Jurie and B. Triggs. Creating efficient codebooks for visual recognition. In Proceedings of IEEE International Conference on Computer Vision (ICCV), volume 1, pages 604--610, 2005.
[8]
S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In In Proc. of IEEE CVPR, 2006.
[9]
B. Leibe, K. Mikolajczyk, and B. Schiele. Efficient clustering and matching for object class recognition. In In Proceedings of British Machine Vision Conference (BMVC), 2006.
[10]
D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2):91--110, 2004.
[11]
J. Matas, O. Chum, U. Martin, and T. Pajdla. Robust wide baseline stereo from maximally stable extremal regions. In Proceedings of the British Machine Vision Conference, Cardiff, Sep. 2002.
[12]
K. Mikolajczyk, B. Leibe, and B. Schiele. Multiple object class detection with a generative model. In Proceedings of IEEE CVPR, New York, June 2006.
[13]
K. Mikolajczyk and C. Schmid. An affine invariant interest point detector. In In Proceeding of European Conference Computer Vision, 2002.
[14]
K. Mikolajczyk and C. Schmid. A performance evaluation of local descriptors. In Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, Toronto, June 2003.
[15]
A. Oliva and A. Torralba. Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42:145--175, 2001.
[16]
S. Paek and C. S.-F. A knowledge engineering approach for image classification based on probabilistic reasoning systems. In Proceedings of IEEE International Conference on Multimedia and Expo, New York, Aug. 2000.
[17]
P. Quelhas, F. Monay, J.-M. Odobez, D. Gatica-Perez, T. Tuytelaars, and L. V. Gool. Modeling scenes with local descriptors and latent aspects. In In Proceedings of IEEE International Conference on Computer Vision (ICCV), Beijing, Oct. 2005.
[18]
J. Sivic, B. C. Russell, A. A. Efros, A. Zisserman, and W. T. Freeman. Discovering objects and their location in image collections. In Proceedings of IEEE International Conference on Computer Vision, Beijing, October 2005.
[19]
J. Sivic and A. Zisserman. Video google: A text retrieval approach to object matching in videos. In Proceedings of IEEE International Conference on Computer Vision, Nice, October 2003.
[20]
A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(12):1349--1380, 2000.
[21]
M. Szummer and R. Picard. Indoor-outdoor image classification. In IEEE International CAIVD Workshop caivd (part of ICCV'98), Bombay, January 1998.
[22]
A. Vailaya, M. Figueiredo, A. Jain, and H. Zhang. Image classification for content-based indexing. IEEE Transactions on Image Processing, 10(1):117--130, 2001.
[23]
T. Westerveld and A. de Vries. Generative probabilistic models for multimedia retrieval: query generation versus document generation. IEE Proceedings - Vision, Image and Signal Processing, 152(6):852--858, 2005.
[24]
J. Weston and C. Watkins. Multi-class support vector machines. Technical Report CSD-TR-98-04, Department of Computer Science, Royal Holloway, University of London, May 1998.
[25]
J. Willamowski, D. Arregui, G. Csurka, C. Dance, and L. Fan. Categorizing nine visual classes using local appearance descriptors. In Proceedings of LAVS Workshop, in ICPR'04, Cambridge, August 2004.

Cited By

View all
  • (2018)Learning topic of dynamic scene using belief propagation and weighted visual words approachSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-014-1384-819:1(71-84)Online publication date: 30-Dec-2018
  • (2013)Peer-to-Peer Network-Based Image RetrievalMultimedia Networking and Coding10.4018/978-1-4666-2660-7.ch013(377-399)Online publication date: 2013
  • (2013)Computer-aided colorectal tumor classification in NBI endoscopy using local featuresMedical Image Analysis10.1016/j.media.2012.08.00317:1(78-100)Online publication date: Jan-2013
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CIVR '07: Proceedings of the 6th ACM international conference on Image and video retrieval
July 2007
655 pages
ISBN:9781595937339
DOI:10.1145/1282280
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 July 2007

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. bag-of-visterms
  2. bag-of-words
  3. images
  4. local descriptors
  5. quantization
  6. scene classification
  7. vision
  8. vocabularies

Qualifiers

  • Article

Conference

CIVR07
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)1
  • Downloads (Last 6 weeks)0
Reflects downloads up to 19 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Learning topic of dynamic scene using belief propagation and weighted visual words approachSoft Computing - A Fusion of Foundations, Methodologies and Applications10.1007/s00500-014-1384-819:1(71-84)Online publication date: 30-Dec-2018
  • (2013)Peer-to-Peer Network-Based Image RetrievalMultimedia Networking and Coding10.4018/978-1-4666-2660-7.ch013(377-399)Online publication date: 2013
  • (2013)Computer-aided colorectal tumor classification in NBI endoscopy using local featuresMedical Image Analysis10.1016/j.media.2012.08.00317:1(78-100)Online publication date: Jan-2013
  • (2011)Natural Image Classification Based on Improved Support Vector MachineApplied Mechanics and Materials10.4028/www.scientific.net/AMM.58-60.238758-60(2387-2391)Online publication date: Jun-2011
  • (2011)Fusing integrated visual vocabularies-based bag of visual words and weighted colour moments on spatial pyramid layout for natural scene image classificationSignal, Image and Video Processing10.1007/s11760-011-0266-07:4(759-775)Online publication date: 20-Oct-2011
  • (2010)Spatial pyramid local keypoints quantization for bag of visual patches image representation2010 10th International Conference on Intelligent Systems Design and Applications10.1109/ISDA.2010.5687083(1270-1274)Online publication date: Nov-2010
  • (2009)Contextual classification of image patches with latent aspect modelsJournal on Image and Video Processing10.1155/2009/6029202009(1-20)Online publication date: 1-Jan-2009
  • (2008)Adaptively Combining Local with Global Information for Natural Scenes CategorizationIEICE - Transactions on Information and Systems10.1093/ietisy/e91-d.7.2087E91-D:7(2087-2090)Online publication date: 1-Jul-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media