Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1743384.1743396acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Recognition of object categories using affine kernels

Published: 29 March 2010 Publication History

Abstract

With the growing image collection on the web, classifying images has become an actively explored problem. In this paper we present a novel approach to the classification of images depicting objects in a category using the odd-man-out principle of visual categorization. Specifically, we build a model of an object category by noting discriminative features that are commonly observed across the member images of the class. Appearance changes due to pose, illumination and intra-class variations are modeled using multi-scale affine kernels. The best matching affine kernel for a given query image is found as the one that has the largest overlap of discriminable features that are commonly observed across the class. We show that using the odd-man-out principle of IQ tests not only results in better feature selection but also in more robust object class categorization, in comparison to the state-of-the-art methods on large benchmark image datasets.

References

[1]
W. E. L. Grimson and T. Lozano-Perez, "Localizing overlapping parts by searching the interpretation tree", in IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 9, issue 4., July 1987, pp. 469--482.
[2]
Wolfson, H. J. & Rigoutsos, I (1997). Geometric Hashing: An Overview. IEEE Computational Science and Engineering, 4(4), 10--21.
[3]
R. Fergus, P. Perona, A. Zisserman: Object Class Recognition by Unsupervised Scale-Invariant Learning. CVPR (2) 2003: 264--271.
[4]
Wei Zhu, Tanveer Fathima Syeda-Mahmood: Image Organization and Retrieval Using a Flexible Shape Model. CAIVD 1998: 31--40.
[5]
A. Pope, D. Lowe, "Learning appearance models for object recognition," in Object Representation in Computer Vision, pp. 201--219, 1996.
[6]
S. Belongie, J. Malik, and J. Puzicha, "Shape Context: A new descriptor for shape matching and object recognition," NIPS 2000.
[7]
Nicu Sebe, Michael S. Lew: Robust Shape Matching. CIVR 2002: 17--28
[8]
Fergus, R., Fei-Fei, L., Perona, P., Zisserman, A., "Learning object categories from Google image search," Proc. ICCV, Vol. 2, Issue, 17-21 Oct. 2005 Page(s): 1816--1823.
[9]
T. Hofmann. Probabilistic latent semantic indexing. In SIGIR, 1999.
[10]
Fei-Fei, L. and Perona, P., "A Bayesian Heirarcical Model for Learning Natural Scene Categories", Proc. CVPR, 2005.
[11]
Sivic, J. and Russell, B. and Efros, A. and Zisserman, A. and Freeman, W., "Discovering object categories in image collections." Proc. Int'l Conf. Computer Vision, Beijing, 2005.
[12]
S. Lazebnik, C. Schmid, and J. Ponce. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In CVPR, volume 2, pages 2169--2178, New York, New York, 2006.
[13]
D. G. Lowe. Distinctive image features from scale-invariant keypoints. IJCV, 60(2):91--110, 2004.
[14]
K. Grauman and T. Darrell, "Pyramid match kernels: Discriminative classification with sets of image features," in Proc. ICCV, 2005.
[15]
http://www.vision.caltech.edu/Image_Datasets/Caltech101/
[16]
M. Varma and D. Ray. "Learning the discriminative power-invariance trade-off," In Proceedings of the IEEE International Conference on Computer Vision, Rio de Janeiro, Brazil, October 2007.
[17]
A Corner Detector based on Global and Local Curvature Properties, http://www.mathworks.com/matlabcentral/fileexchange/loadFile.do?objectId=7652&objectType=file
[18]
Lowe, D. G., "Distinctive Image Features from Scale-Invariant Keypoints", International Journal of Computer Vision, 60, 2, pp. 91--110, 2004.
[19]
S. Parks and H. Black, "Building Thinking Skills," Critical Thinking Books & Software, Pacific Grove, CA. 1997.

Cited By

View all
  • (2011)Content-Based retrieval in endomicroscopyProceedings of the Second MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support10.1007/978-3-642-28460-1_2(12-23)Online publication date: 22-Sep-2011

Index Terms

  1. Recognition of object categories using affine kernels

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MIR '10: Proceedings of the international conference on Multimedia information retrieval
    March 2010
    600 pages
    ISBN:9781605588155
    DOI:10.1145/1743384
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 March 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. affine kernels
    2. class categorization
    3. image indexing
    4. image retrieval
    5. object categorization
    6. object recognition

    Qualifiers

    • Research-article

    Conference

    MIR '10
    Sponsor:
    MIR '10: International Conference on Multimedia Information Retrieval
    March 29 - 31, 2010
    Pennsylvania, Philadelphia, USA

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 14 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2011)Content-Based retrieval in endomicroscopyProceedings of the Second MICCAI international conference on Medical Content-Based Retrieval for Clinical Decision Support10.1007/978-3-642-28460-1_2(12-23)Online publication date: 22-Sep-2011

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media