Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2903220.2903246acmotherconferencesArticle/Chapter ViewAbstractPublication PagessetnConference Proceedingsconference-collections
short-paper

Complex layout analysis based on contour classification and morphological operations

Published: 18 May 2016 Publication History

Abstract

In this paper, a technique appropriate for document image layout analysis is presented. The technique is appropriate for colored and complex layouts of newspapers and journals. It is a hybrid technique that makes use of the contour classification method and also applies morphological operators. Detailed experiments on 2000 scanned images from newspapers gave an accuracy of more than 95% while the computational cost per page is less than a half second.

References

[1]
Wong, Kwan Y., Richard G. Casey, and Friedrich M. Wahl. "Document analysis system." IBM journal of research and development 26.6 (1982): 647--656.
[2]
Fletcher, Lloyd Alan, and Rangachar Kasturi. "A robust algorithm for text string separation from mixed text/graphics images." Pattern Analysis and Machine Intelligence, IEEE Transactions on 10.6 (1988): 910--918.
[3]
O'Gorman, Lawrence. "The document spectrum for page layout analysis." Pattern Analysis and Machine Intelligence, IEEE Transactions on 15.11 (1993): 1162--1173.
[4]
Simon, Anikó, and Jean Christophe Pret. "A fast algorithm for bottom-up document layout analysis." Pattern Analysis and Machine Intelligence, IEEE Transactions on 19.3 (1997): 273--277.
[5]
Koo, Hyung Il, and Duck Hoon Kim. "Scene text detection via connected component clustering and nontext filtering." Image Processing, IEEE Transactions on 22.6 (2013): 2296--2305.
[6]
Nagy, George, Sharad Seth, and Mahesh Viswanathan. "A prototype document image analysis system for technical journals." Computer 25.7 (1992): 10--22.
[7]
Kise, Koichi, O. Yanagida, and Shinobu Takamatsu. "Page segmentation based on thinning of background." Pattern Recognition, 1996., Proceedings of the 13th International Conference on. Vol. 3. IEEE, 1996.
[8]
Breuel, Thomas M. "Two geometric algorithms for layout analysis." Document analysis systems v. Springer Berlin Heidelberg, 2002. 188--199.
[9]
T. Pavlidis, J. Zhou, Page segmentation and classification, CVGIP: Graphical Models and Image Processing, vol. 54, pp. 484--496, 1992.
[10]
Antonacopoulos, A., and R. T. Ritchings. "Flexible page segmentation using the background." Pattern Recognition, 1994. Vol. 2-Conference B: Computer Vision & Image Processing., Proceedings of the 12th IAPR International. Conference on. Vol. 2. IEEE, 1994.
[11]
Chen, Kai, Fei Yin, and Cheng-Lin Liu. "Hybrid page segmentation with efficient whitespace rectangles extraction and grouping." Document Analysis and Recognition (ICDAR), 2013 12th International Conference on. IEEE, 2013.
[12]
Suzuki, Satoshi. "Topological structural analysis of digitized binary images by border following." Computer Vision, Graphics, and Image Processing 30.1 (1985): 32--46.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
SETN '16: Proceedings of the 9th Hellenic Conference on Artificial Intelligence
May 2016
249 pages
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • EETN: Hellenic Artificial Intelligence Society

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 May 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Contour classification
  2. Document images
  3. Morphological operators
  4. Page Layout Analysis

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Conference

SETN '16

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 85
    Total Downloads
  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media