Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/524178.836741guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Document Layout Structure Extraction Using Bounding Boxes of Different Entities

Published: 02 December 1996 Publication History

Abstract

This paper presents an efficient technique for document page layout structure extraction and classification by analyzing the spatial configuration of the bounding boxes of different entities on the given image. The algorithm segments an image into a list of homogeneous zones. The classification algorithm labels each zone as text, table, line-drawing, halftone, ruling, or noise. The text-lines and words are extracted within text zones and neighboring text-lines are merged to form text-blocks. The tabular structure is further decomposed into row and column items. Finally, the document layout hierarchy is produced from these extracted entities.

Cited By

View all
  • (2016)Page segmentation using minimum homogeneity algorithm and adaptive mathematical morphologyInternational Journal on Document Analysis and Recognition10.1007/s10032-016-0265-319:3(191-209)Online publication date: 1-Sep-2016
  • (2015)Hybrid page segmentation using multilevel homogeneity structureProceedings of the 9th International Conference on Ubiquitous Information Management and Communication10.1145/2701126.2701138(1-6)Online publication date: 8-Jan-2015
  • (2008)Spatial Relation Based Object Extraction from the World Wide WebProceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 0310.1109/WIIAT.2008.371(94-97)Online publication date: 9-Dec-2008
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
WACV '96: Proceedings of the 3rd IEEE Workshop on Applications of Computer Vision (WACV '96)
December 1996
ISBN:0818676205

Publisher

IEEE Computer Society

United States

Publication History

Published: 02 December 1996

Author Tags

  1. Document layout analysis
  2. bounding box projection.
  3. layout structure
  4. page segmentation

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Page segmentation using minimum homogeneity algorithm and adaptive mathematical morphologyInternational Journal on Document Analysis and Recognition10.1007/s10032-016-0265-319:3(191-209)Online publication date: 1-Sep-2016
  • (2015)Hybrid page segmentation using multilevel homogeneity structureProceedings of the 9th International Conference on Ubiquitous Information Management and Communication10.1145/2701126.2701138(1-6)Online publication date: 8-Jan-2015
  • (2008)Spatial Relation Based Object Extraction from the World Wide WebProceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 0310.1109/WIIAT.2008.371(94-97)Online publication date: 9-Dec-2008
  • (2007)A table-form extraction with artefact removalProceedings of the 2007 ACM symposium on Applied computing10.1145/1244002.1244144(622-626)Online publication date: 11-Mar-2007
  • (2006)A new table interpretation methodology with little knowledge baseProceedings of the 2006 ACM symposium on Applied computing10.1145/1141277.1141470(847-852)Online publication date: 23-Apr-2006
  • (2006)Handwritten artefact identification method for table interpretation with little use of previous knowledgeProceedings of the 7th international conference on Document Analysis Systems10.1007/11669487_16(176-185)Online publication date: 13-Feb-2006
  • (2001)An Optimization Methodology for Document Structure Extraction on Latin Character DocumentsIEEE Transactions on Pattern Analysis and Machine Intelligence10.1109/34.93584623:7(719-734)Online publication date: 1-Jul-2001

View Options

View options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media