Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1815330.1815355acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Query driven word retrieval in graphical documents

Published: 09 June 2010 Publication History

Abstract

In this paper, we present an approach towards the retrieval of words from graphical document images. In graphical documents, due to presence of multi-oriented characters in non-structured layout, word indexing is a challenging task. The proposed approach uses recognition results of individual components to form character pairs with the neighboring components. An indexing scheme is designed to store the spatial description of components and to access them efficiently. Given a query text word (ascii/unicode format), the character pairs present in it are searched in the document. Next the retrieved character pairs are linked sequentially to form character string. Dynamic programming is applied to find different instances of query words. A string edit distance is used here to match the query word as the objective function. Recognition of multi-scale and multi-oriented character component is done using Support Vector Machine classifier. To consider multi-oriented character strings the features used in the SVM are invariant to character orientation. Experimental results show that the method is efficient to locate a query word from multi-oriented text in graphical documents.

References

[1]
S. Adam, J. M. Ogier, C. Carlon, R. Mullot, J. Labiche, and J. Gardes. Symbol and character recognition: application to engineering drawing. IJDAR, 3(2):89--101, 2000.
[2]
R. Cao and C. Tan. Text/graphics separation in maps. In Proceedings of GREC, Canada, 2001.
[3]
A. Clavelli and D. Karatzas. Text segmentation in colour posters from the spanish civil war era. In Proceedings of ICDAR, pages 181--185, Barcelona, Spain, 2009.
[4]
M. Delalandre, T. Pridmore, E. Valveny, H. Locteau, and E. Trupin. Building synthetic graphical documents for performance evaluation. Revised Selected Papers of Workshop on GREC, LNCS, 5046:288--298, 2008.
[5]
D. Doermann. The indexing and retrieval of document images: a survey. In Computer Vision and Image Understanding, volume 70, pages 287--298, 1998.
[6]
L. A. Fletcher and R. Kasturi. A robust algorithm for text string separation from mixed text/graphics images. IEEE Transactions on PAMI, 10(6):910--918, 1988.
[7]
H. Hase, T. Shinokawa, M. Yoneda, and C. Y. Suen. Recognition of rotated characters by eigen-space. In Proceedings of ICDAR, pages 731--735, Edinburgh, Scotland, 2003.
[8]
H. Hase, M. Yoneda, T. Shinokawa, and C. Y. Suen. Alignment of free layout color texts for character recognition. In Proceedings of ICDAR, pages 932--936, Seattle, USA, 2001.
[9]
H. Luo, G. Agam, and I. Dinstein. Directional mathematical morphology approach for line thinning and extraction of character strings from maps and line drawings. In Proceedings of ICDAR, volume 1, page 257, Montreal, Canada, 1995.
[10]
S. Marinai, E. Marino, and G. Soda. Indexing and retrieval of words in old documents. In Proceedings of ICDAR, page 223, Edinburgh, Scotland, 2003.
[11]
P. P. Roy, U. Pal, J. Lladós, and M. Delalandre. Multi-oriented and multi-sized touching character segmentation using dynamic programming. In Proceedings of ICDAR, pages 11--15, Barcelona, Spain, 2009.
[12]
P. P. Roy, U. Pal, J. Lladós, and F. Kimura. Multi-oriented english text line extraction using background and foreground information. In Proceedings of DAS, pages 315--322, Nara, Japan, 2008.
[13]
A. Takasu. Document filtering for fast approximate string matching of errorneous text. In Proceedings of ICDAR, pages 916--920, Seattle, USA, 2001.
[14]
C. L. Tan and P. O. Ng. Text extraction using pyramid. Pattern Recognition, 31(1):63--72, 1998.
[15]
K. Tombre and B. Lamiroy. Graphics recognition - from re-engineering to retrieval. In Proceedings of ICDAR, pages 148--155, Edinburgh, Scotland, 2003.
[16]
K. Tombre, S. Tabbone, L. Peissier, B. Lamiroy, and P. Dosch. Text/graphics separation revisited. In Proceedings of DAS, pages 200--211, NY, USA, 2002.
[17]
Q. Xie and A. Kobayashi. A construction of pattern recognition system invariant of translation, scale-change and rotation transformation of pattern. Trans. of the Society of Instrument and Control Engineers, pages 1167--1174, 1991.

Cited By

View all
  • (2019)New Tools for the Classification and Filtering of Historical MapsISPRS International Journal of Geo-Information10.3390/ijgi81004558:10(455)Online publication date: 14-Oct-2019
  • (2014)Word Spotting in Bangla and English Graphical Documents2014 22nd International Conference on Pattern Recognition10.1109/ICPR.2014.525(3044-3049)Online publication date: Aug-2014
  • (2014)Word searching in unconstrained layout using character pair codingInternational Journal on Document Analysis and Recognition10.1007/s10032-014-0227-617:4(343-358)Online publication date: 1-Dec-2014
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graphical document analysis
  2. graphics recognition
  3. information retrieval
  4. word spotting

Qualifiers

  • Research-article

Funding Sources

  • Spanish projects CONSOLIDER-INGENIO 2010

Conference

DAS '10

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)New Tools for the Classification and Filtering of Historical MapsISPRS International Journal of Geo-Information10.3390/ijgi81004558:10(455)Online publication date: 14-Oct-2019
  • (2014)Word Spotting in Bangla and English Graphical Documents2014 22nd International Conference on Pattern Recognition10.1109/ICPR.2014.525(3044-3049)Online publication date: Aug-2014
  • (2014)Word searching in unconstrained layout using character pair codingInternational Journal on Document Analysis and Recognition10.1007/s10032-014-0227-617:4(343-358)Online publication date: 1-Dec-2014
  • (2013)A Two-Stage Approach for Word Spotting in Graphical DocumentsProceedings of the 2013 12th International Conference on Document Analysis and Recognition10.1109/ICDAR.2013.71(319-323)Online publication date: 25-Aug-2013

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media