Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3557918.3565864acmconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
short-paper

Incorporating spatial context for post-OCR in map images

Published: 14 November 2022 Publication History

Abstract

Extracting text from historical maps using Optical Character Recognition (OCR) engines often results in partially or incorrectly recognized words due to complex map content. Previous work utilizes lexical-based approaches with linguistic context or applies language models to correct OCR results for documents. However, these post-OCR methods cannot directly consider spatial relations of map text for correction. For example, "Mississippi" and "River" constitute the place phrase "Mississippi River" (linguistic relation), and near "highway", there are likely to exist intersected "road" to enter the "highway" (spatial relation). This paper presents a novel approach that exploits the spatial arrangement of map text using a contextual language model, BART [6] for post-processing of map text from OCR. The approach first structures word-level map text into sentences based on their spatial arrangement while preserving the spatial location of words constituting a place name and corrects imperfect OCR text using neighboring information. To train BART for capturing spatial relations in map text, we automatically generate large numbers of synthetic maps to fine-tune BART with location names and their spatial context. We conduct experiments on synthetic and real-world historical maps of various map styles and scales and show that the proposed method can achieve significant improvement over the commonly used lexical approach.

References

[1]
Y-Y. Chiang, S. Leyk, and C. Knoblock. 2014. A Survey of Digital Map Processing Techniques. ACM Comput. Surv. 47, 1 (may 2014), 44 pages.
[2]
G. Chiron, A. Doucet, M. Coustaty, and J. Moreux. 2017. ICDAR2017 Competition on Post-OCR Text Correction. In 2017 ICDAR, Vol. 01. 1423--1428.
[3]
S. Clematide, L. Furrer, and M. Volk. 2016. Crowdsourcing an OCR gold standard for a German and French heritage corpus. ELRA, Portorož, Slovenia, 975--982.
[4]
Y. Hu, X. Jing, Y. Ko, and J. Rayz. 2020. Misspelling Correction with Pre-trained Contextual Language Model. In 2020 IEEE 19th ICCI*CC. 144--149.
[5]
VI. Levenshtein. 1966. Binary codes capable of correcting deletions, insertions, and reversals. In Soviet physics doklady, Vol. 10. Soviet Union, 707--710.
[6]
M. Lewis, Y. Liu, N. Goyal, M. Ghazvininejad, A. Mohamed, O. Levy, V. Stoyanov, and L. Zettlemoyer. 2020. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In Proceedings of the 58th Annual Meeting of the ACL. ACL, 7871--7880.
[7]
Z. Li, Y-Y. Chiang, S. Tavakkol, B. Shbita, J. Uhl, S. Leyk, and C. Knoblock. 2020. An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images. In Proceedings of the 26th ACM SIGKDD. ACM, NY, USA, 3290--3298.
[8]
TTH. Nguyen, A. Jatowt, M. Coustaty, and A. Doucet. 2021. Survey of Post-OCR Processing Approaches. 54, 6 (2021).
[9]
TTH. Nguyen, A. Jatowt, N. Nguyen, M. Coustaty, and A. Doucet. 2020. Neural Machine Translation with BERT for Post-OCR Error Detection and Correction. In Proceedings of the ACM/IEEE JCDL. ACM, New York, NY, USA, 333--336.
[10]
A. Ray, Z. Chen, B. Gafford, N. Gifford, J. Kumar, A. Lamsa, L. Niehus-Staab, J. Weinman, and E. Learned-Miller. 2018. Historical map annotations for text detection and recognition.
[11]
C. Rigaud, A. Doucet, M. Coustaty, and J. Moreux. 2019. ICDAR 2019 Competition on Post-OCR Text Correction. In 2019 ICDAR. 1588--1593.
[12]
D. Rumsey. 2002. David Rumsey map collection. Cartography Associates.
[13]
E. Soper, S. Fujimoto, and Y. Yu. 2021. BART for Post-Correction of OCR Newspaper Text. In Proceedings of W-NUT 2021. ACL, Online, 284--290.
[14]
X. Zhang, Y. Su, S. Tripathi, and Z. Tu. 2022. Text Spotting Transformers. In Proceedings of the IEEE/CVF CVPR. 9519--9528.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
GeoAI '22: Proceedings of the 5th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery
November 2022
101 pages
ISBN:9781450395328
DOI:10.1145/3557918
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 November 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. BART
  2. information retrieval
  3. neural networks
  4. post-OCR processing

Qualifiers

  • Short-paper

Funding Sources

Conference

SIGSPATIAL '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 17 of 25 submissions, 68%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 103
    Total Downloads
  • Downloads (Last 12 months)27
  • Downloads (Last 6 weeks)5
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media