Abstract
The paper describes a fast system for appearance based image recognition. It uses local invariant descriptors and efficient nearest neighbor search. First, local affine invariant regions are found nested at multiscale intensity extremas. These regions are characterized by nine generalized color moment invariants. An efficient novel method called HPAT (hyper-polyhedron with adaptive threshold) is introduced for efficient localization of the nearest neighbor in feature space.
The invariants make the method robust against changing illumination and viewpoint. The locality helps to resolve occlusions. The proposed indexing method overcomes the drawbacks of most binary tree-like indexing techniques, namely the high complexity in high dimensional data sets and the boundary problem. The database representation is very compact and the retrieval close to realtime on a standard PC. The performance of the proposed method is demonstrated on a public database containing 1005 images of urban scenes. Experiments with an image database containing objects are also presented.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
A. Baumberg. Reliable feature matching across widely separated views. In Computer Vision and Pattern Recognition, pages 774–781, 2000
Krystian Mikolajczyk and Cordelia Schmid. An affune invariant interest points detector. In European Conference on Computer Vision, pages 128–142, 2002
F. Mindru, T. Moons, and L. Van Gool. Recognizing color patterns irrespective of viewpoint and illumination. In Computer Vision and Pattern Recognition, pages 368–373, 1999
Sameer A. Nene and Shree K. Nayar. A simple algorithm for nearest neighbor search in high dimensions. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19, 1997
J. Matas O. Chum, M. Urban and T. Pajdla. Robust wide baseline stereo from maximally stable extremal regions. In British Machine Vision Conference, 2002
Hao Shao, Tomáš Svoboda, and Luc Van Gool. ZuBuD — Zörich buildings database for image based recognition. Technical Report 260, Computer Vision Laboratory, Swiss Federal Institute of Technology, March 2003. Database downloadable from http://www.vision.ee.ethz.ch/showroom/
T. Tuytelaars and Van Gool. Wide baseline stero based on local affinely invariant regions. In British Machine Vision Conference, 2000
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shao, H., Svoboda1, T., Tuytelaars, T., Van Gool, L. (2003). HPAT Indexing for Fast Object/Scene Recognition Based on Local Appearance. In: Bakker, E.M., Lew, M.S., Huang, T.S., Sebe, N., Zhou, X.S. (eds) Image and Video Retrieval. CIVR 2003. Lecture Notes in Computer Science, vol 2728. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45113-7_8
Download citation
DOI: https://doi.org/10.1007/3-540-45113-7_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40634-1
Online ISBN: 978-3-540-45113-6
eBook Packages: Springer Book Archive