Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1007/11669487_48guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval

Published: 13 February 2006 Publication History

Abstract

Camera-based document image retrieval is a task of searching document images from the database based on query images captured using digital cameras. For this task, it is required to solve the problem of “perspective distortion” of images,as well as to establish a way of matching document images efficiently. To solve these problems we have proposed a method called Locally Likely Arrangement Hashing (LLAH) which is characterized by both the use of a perspective invariant to cope with the distortion and the efficiency: LLAH only requires O(N) time where N is the number of feature points that describe the query image. In this paper, we introduce into LLAH an affine invariant instead of the perspective invariant so as to improve its adjustability. Experimental results show that the use of the affine invariant enables us to improve either the accuracy from 96.2% to 97.8%, or the retrieval time from 112 msec./query to 75 msec./query by selecting parameters of processing.

References

[1]
D. Doermann. The Indexing and Retrieval of Document Images: A Survey. Computer Vision and Image Understanding, 70, 3, pages 287-298, 1998.
[2]
J. J. Hull. Document image matching and retrieval with multiple distortion-invariant descriptors. Document Analysis Systems, pages 379-396, 1995.
[3]
D. Doermann, H. Li and O. Kia. The detection of duplicates in document image databases. Proc. ICDAR'97, pages 314-318, 1997.
[4]
D. Doermann, J. Liang and H. Li. Progress in camera-based document image analysis. Proc. ICDAR'03, pages 606-616, 2003.
[5]
P. Clark and M. Mirmehdi. Recognising text in real scenes. IJDAR, 4, pages 243-257, 2002.
[6]
S. Pollard and M. Pilu. Building cameras for capturing documents. IJDAR, 7, pages 123-137, 2005.
[7]
H. J. Wolfson and I. Rigoutsos. Geometric hashing: an overview. IEEE Computational Science & Engineering, Vol. 4, No. 4, pages 10-21, 1997.
[8]
T. Nakai, K. Kise and M. Iwamura. Hashing with Local Combinations of Feature Points and Its Application to Camera-Based Document Image Retrieval. Proc. CBDAR'05, pages 87-94, 2005.
[9]
T. Suk and J. Flusser. Point-based projective invariants. Pattern Recognition, 33, pages 251- 261, 2000.
[10]
B. Huet and E. R. Hancock. Cartographic indexing into a database of remotely sensed images. WACV96, pages 8-14, 1996.
[11]
C. A. Rothwell, A. Zisserman, D. A. Fosyth and J. L. Mundy. Using projective invariants for constant time library indexing in model based vision. Proc. BMVC, pages 62-70, 1991.

Cited By

View all
  • (2018)Learning-based word segmentation for reliable text document retrieval and augmentationProceedings of the 24th ACM Symposium on Virtual Reality Software and Technology10.1145/3281505.3281585(1-2)Online publication date: 28-Nov-2018
  • (2018)Augmented songbookMultimedia Tools and Applications10.1007/s11042-017-4991-477:11(13773-13798)Online publication date: 1-Jun-2018
  • (2017)Dynamic Projection Mapping onto Deforming Non-Rigid Surface Using Deformable Dot Cluster MarkerIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2016.259291023:3(1235-1248)Online publication date: 1-Mar-2017
  • Show More Cited By

Index Terms

  1. Use of affine invariants in locally likely arrangement hashing for camera-based document image retrieval
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Guide Proceedings
      DAS'06: Proceedings of the 7th international conference on Document Analysis Systems
      February 2006
      627 pages
      ISBN:3540321403
      • Editors:
      • Horst Bunke,
      • A. Lawrence Spitz

      Sponsors

      • Human Ware Group, Christchurch: Human Ware Group, Christchurch, New Zealand
      • DocRec Ltd.: DocRec Ltd., Atawhai, Nelson, New Zealand
      • University of Bern: University of Bern, Switzerland
      • Siemens AG
      • Hitachi Central Research Laboratory: Hitachi Central Research Laboratory, Tokyo, Japan

      Publisher

      Springer-Verlag

      Berlin, Heidelberg

      Publication History

      Published: 13 February 2006

      Qualifiers

      • Article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 01 Oct 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2018)Learning-based word segmentation for reliable text document retrieval and augmentationProceedings of the 24th ACM Symposium on Virtual Reality Software and Technology10.1145/3281505.3281585(1-2)Online publication date: 28-Nov-2018
      • (2018)Augmented songbookMultimedia Tools and Applications10.1007/s11042-017-4991-477:11(13773-13798)Online publication date: 1-Jun-2018
      • (2017)Dynamic Projection Mapping onto Deforming Non-Rigid Surface Using Deformable Dot Cluster MarkerIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2016.259291023:3(1235-1248)Online publication date: 1-Mar-2017
      • (2016)Aerial image sequence geolocalization with road traffic as invariant featureImage and Vision Computing10.1016/j.imavis.2016.05.01452:C(218-229)Online publication date: 1-Aug-2016
      • (2015)Quantifying reading habitsProceedings of the 2015 ACM International Joint Conference on Pervasive and Ubiquitous Computing10.1145/2750858.2804278(87-96)Online publication date: 7-Sep-2015
      • (2015)The Delaunay Document Layout DescriptorProceedings of the 2015 ACM Symposium on Document Engineering10.1145/2682571.2797059(167-175)Online publication date: 8-Sep-2015
      • (2014)Robust random dot markersProceedings of the 20th ACM Symposium on Virtual Reality Software and Technology10.1145/2671015.2671022(45-54)Online publication date: 11-Nov-2014
      • (2014)3D model-based gaze estimation in natural readingProceedings of the Symposium on Eye Tracking Research and Applications10.1145/2578153.2578164(87-90)Online publication date: 26-Mar-2014
      • (2013)Annotate meProceedings of the 2013 ACM conference on Pervasive and ubiquitous computing adjunct publication10.1145/2494091.2494165(231-234)Online publication date: 8-Sep-2013
      • (2013)I know what you are readingProceedings of the 2013 International Symposium on Wearable Computers10.1145/2493988.2494354(113-116)Online publication date: 8-Sep-2013
      • Show More Cited By

      View Options

      View options

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media