research-article

Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection

Authors:

Craig A. KnoblockAuthors Info & Claims

GEOAI '21: Proceedings of the 4th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery

Pages 17 - 26

https://doi.org/10.1145/3486635.3491070

Published: 08 November 2021 Publication History

Abstract

Many historical map sheets are publicly available for studies that require long-term historical geographic data. The cartographic design of these maps includes a combination of map symbols and text labels. Automatically reading text labels from map images could greatly speed up the map interpretation and helps generate rich metadata describing the map content. Many text detection algorithms have been proposed to locate text regions in map images automatically, but most of the algorithms are trained on out-of-domain datasets (e.g., scenic images). Training data determines the quality of machine learning models, and manually annotating text regions in map images is labor-extensive and time-consuming. On the other hand, existing geographic data sources, such as Open-StreetMap (OSM), contain machine-readable map layers, which allow us to separate out the text layer and obtain text label annotations easily. However, the cartographic styles between OSM map tiles and historical maps are significantly different. This paper proposes a method to automatically generate an unlimited amount of annotated historical map images for training text detection models. We use a style transfer model to convert contemporary map images into historical style and place text labels upon them. We show that the state-of-the-art text detection models (e.g., PSENet) can benefit from the synthetic historical maps and achieve significant improvement for historical map text detection.

References

[1]

Yao-Yi Chiang, Weiwei Duan, Stefan Leyk, Johannes H Uhl, and Craig A Knoblock. 2020. Using Historical Maps in Scientific Studies: Applications, Challenges, and Best Practices. Springer International Publishing. https://doi.org/10.1007/978-3-319-66908-3

[2]

Chee Kheng Ch'ng, Chee Seng Chan, and Chenglin Liu. 2020. Total-Text: Towards Orientation Robustness in Scene Text Detection. International Journal on Document Analysis and Recognition (IJDAR) 23 (2020), 31--52. https://doi.org/10.1007/s10032-019-00334-z

Digital Library

[3]

Jia Deng, Wei Dong, Richard Socher, Li-Jia Li, Kai Li, and Li Fei-Fei. 2009. Imagenet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition. IEEE, 248--255.

[4]

Esri. [n.d.]. GIS Mapping Software, Location Intelligence and Spatial Analytics. https://www.esri.com/en-us/home

[5]

Ankush Gupta, Andrea Vedaldi, and Andrew Zisserman. 2016. Synthetic data for text localisation in natural images. In Computer Vision and Pattern Recognition. 2315--2324.

[6]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Computer Vision and Pattern Recognition. 770--778.

[7]

Yingying Jiang, Xiangyu Zhu, Xiaobing Wang, Shuli Yang, Wei Li, Hua Wang, Pei Fu, and Zhenbo Luo. 2017. R2CNN: rotational region CNN for orientation robust scene text detection. arXiv preprint arXiv:1706.09579 (2017).

[8]

Dimosthenis Karatzas, Lluis Gomez-Bigorda, Anguelos Nicolaou, Suman Ghosh, Andrew Bagdanov, Masakazu Iwamura, Jiri Matas, Lukas Neumann, Vijay Ramaseshan Chandrasekhar, Shijian Lu, et al. 2015. ICDAR 2015 competition on robust reading. In International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1156--1160.

[9]

Xiang Li, Wenhai Wang, Wenbo Hou, Ruo-Ze Liu, Tong Lu, and Jian Yang. 2018. Shape robust text detection with progressive scale expansion network. arXiv preprint arXiv:1806.02559 (2018).

[10]

Zekun Li, Yao-Yi Chiang, Sasan Tavakkol, Basel Shbita, Johannes H Uhl, Stefan Leyk, and Craig A Knoblock. 2020. An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images. In ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 3290--3298.

Digital Library

[11]

New York Public Library. [n.d.]. Maps and Atlases - NYPL Digital Collections. https://digitalcollections.nypl.org/collections/lane/maps-atlases

[12]

Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, and Serge Belongie. 2017. Feature pyramid networks for object detection. In Computer Vision and Pattern Recognition. 2117--2125.

[13]

Yuliang Liu, Lianwen Jin, Shuaitao Zhang, Canjie Luo, and Sheng Zhang. 2019. Curved scene text detection via transverse and longitudinal sequence connection. Pattern Recognition 90 (2019), 337--345.

Digital Library

[14]

Shangbang Long, Jiaqiang Ruan, Wenjie Zhang, Xin He, Wenhao Wu, and Cong Yao. 2018. Textsnake: A flexible representation for detecting text of arbitrary shapes. In European Conference on Computer Vision (ECCV). 20--36.

Digital Library

[15]

Shangbang Long and Cong Yao. 2020. Unrealtext: Synthesizing realistic scene text images from the unreal world. arXiv preprint arXiv:2003.10608 (2020).

[16]

Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-lin Liu, et al. 2019. ICDAR2019 robust reading challenge on multi-lingual scene text detection and recognition---RRC-MLT-2019. In International Conference on Document Analysis and Recognition (ICDAR). IEEE, 1582--1587.

[17]

Library of Congress. [n.d.]. Sanborn Maps. https://www.loc.gov/collections/sanborn-maps

[18]

National Library of Scotland. [n.d.]. Ordnance Survey. https://maps.nls.uk/os/

[19]

Aria Pezeshk and Richard L Tutwiler. 2011. Automatic feature extraction and text recognition from scanned topographic maps. IEEE Transactions on Geoscience and Remote Sensing 49, 12 (2011), 5047--5063.

[20]

QGIS. [n.d.]. QGIS API Documentation. https://qgis.org/api/classQgsPalLayerSettings.html

[21]

Baoguang Shi, Cong Yao, Minghui Liao, Mingkun Yang, Pei Xu, Linyan Cui, Serge Belongie, Shijian Lu, and Xiang Bai. 2017. ICDAR2017 Competition on Reading Chinese Text in the Wild (RCTW-17). In 2017 14th IAPR ICDAR, Vol. 01. 1429--1434. https://doi.org/10.1109/ICDAR.2017.233

[22]

United States Geological Survey. [n.d.]. United States Geological Survey maps. https://www.usgs.gov/products/maps/overview

[23]

Andreas Veit, Tomas Matera, Lukas Neumann, Jiri Matas, and Serge Belongie. 2016. Coco-text: Dataset and benchmark for text detection and recognition in natural images. arXiv preprint arXiv:1601.07140 (2016).

[24]

Wenhai Wang, Enze Xie, Xiang Li, Wenbo Hou, Tong Lu, Gang Yu, and Shuai Shao. 2019. Shape robust text detection with progressive scale expansion network. In Computer Vision and Pattern Recognition. 9336--9345.

[25]

Jerod Weinman. 2017. Geographic and style models for historical map alignment and toponym recognition. In International Conference on Document Analysis and Recognition, Vol. 1. IEEE, 957--964.

[26]

Christian Wolf and Jean-Michel Jolion. 2006. Object count/area graphs for the evaluation of object detection and segmentation algorithms. IJDAR 8, 4 (2006), 280--296.

[27]

Yue Wu and Prem Natarajan. 2017. Self-organized text detection with minimal post-processing via border learning. In International Conference on Computer Vision.

[28]

Chun Yang, Xu-Cheng Yin, Hong Yu, Dimosthenis Karatzas, and Yu Cao. 2017. ICDAR2017 Robust Reading Challenge on Text Extraction from Biomedical Literature Figures (DeTEXT). In 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), Vol. 01. 1444--1447. https://doi.org/10.1109/ICDAR.2017.235

[29]

Cong Yao, Xiang Bai, Wenyu Liu, Yi Ma, and Zhuowen Tu. 2012. Detecting texts of arbitrary orientations in natural images. In Computer Vision and Pattern Recognition. IEEE, 1083--1090.

[30]

Tai-Ling Yuan, Zhe Zhu, Kun Xu, Cheng-Jun Li, and Shi-Min Hu. 2018. Chinese text in the wild. arXiv preprint arXiv:1803.00085 (2018).

[31]

Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, and Jiajun Liang. 2017. EAST: an efficient and accurate scene text detector. In Computer Vision and Pattern Recognition. 2642--2651.

Cited By

Wu DOu LHuang HCao YLin XLe TYao SLee T(2024)Animated Pictorial MapsSIGGRAPH Asia 2024 Posters10.1145/3681756.3697896(1-3)Online publication date: 3-Dec-2024
https://dl.acm.org/doi/10.1145/3681756.3697896
Lin YChiang YBaeza-Yates RBonchi F(2024)Hyper-Local Deformable Transformers for Text Spotting on Historical MapsProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671589(5387-5397)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671589
Kang YGao SRoth R(2024)Artificial intelligence studies in cartography: a review and synthesis of methods, applications, and ethicsCartography and Geographic Information Science10.1080/15230406.2023.2295943(1-32)Online publication date: 16-Jan-2024
https://doi.org/10.1080/15230406.2023.2295943
Show More Cited By

Index Terms

Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection
1. Applied computing
  1. Document management and text processing
    1. Document capture
      1. Document analysis
      2. Graphics recognition and interpretation
2. Information systems
  1. Information systems applications
    1. Digital libraries and archives

Recommendations

ICDAR 2024 Competition on Historical Map Text Detection, Recognition, and Linking
Document Analysis and Recognition - ICDAR 2024
Abstract
Text on digitized historical maps contains valuable information, e.g., providing georeferenced political and cultural context. The goal of the ICDAR 2024 MapText Competition is to benchmark methods that automatically extract textual content on ...
Historical Map Toponym Extraction for Efficient Information Retrieval
Document Analysis Systems
Abstract
The paper deals with detection, classification and recognition of toponyms in hand-drawn historical cadastral maps. Toponyms are local names of towns, villages and landscape features such as rivers, forests etc. The detected and recognized ...
Deformable Part Models for Automatically Georeferencing Historical Map Images
SIGSPATIAL '19: Proceedings of the 27th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

Libraries are digitizing their collections of maps from all eras, generating increasingly large online collections of historical cartographic resources. Aligning such maps to a modern geographic coordinate system greatly increases their utility. This ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

GEOAI '21: Proceedings of the 4th ACM SIGSPATIAL International Workshop on AI for Geographic Knowledge Discovery

November 2021

77 pages

ISBN:9781450391207

DOI:10.1145/3486635

Editors:
Dalton Lunga
Oak Ridge National Laboratory, TN, USA
,
Lexie Yang
Oak Ridge National Laboratory, TN, USA
,
Song Gao
University of Wisconsin, Madison, WI, USA
,
Bruno Martins
University of Lisbon, Portugal
,
Yingjie Hu
University at Buffalo, NY, USA
,
Xueqing Deng
University of California, Merced, CA, USA
,
Shawn Newsam
University of California, Merced, CA, USA

Copyright © 2021 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGSPATIAL: ACM Special Interest Group on Spatial Information

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 November 2021

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Funding Sources

the National Endowment for the Humanities

Conference

SIGSPATIAL '21

Sponsor:

SIGSPATIAL

SIGSPATIAL '21: 29th International Conference on Advances in Geographic Information Systems

November 2 - 5, 2021

Beijing, China

Acceptance Rates

Overall Acceptance Rate 17 of 25 submissions, 68%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

4
Total Citations
View Citations
178
Total Downloads

Downloads (Last 12 months)32
Downloads (Last 6 weeks)8

Reflects downloads up to 16 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wu DOu LHuang HCao YLin XLe TYao SLee T(2024)Animated Pictorial MapsSIGGRAPH Asia 2024 Posters10.1145/3681756.3697896(1-3)Online publication date: 3-Dec-2024
https://dl.acm.org/doi/10.1145/3681756.3697896
Lin YChiang YBaeza-Yates RBonchi F(2024)Hyper-Local Deformable Transformers for Text Spotting on Historical MapsProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3637528.3671589(5387-5397)Online publication date: 25-Aug-2024
https://dl.acm.org/doi/10.1145/3637528.3671589
Kang YGao SRoth R(2024)Artificial intelligence studies in cartography: a review and synthesis of methods, applications, and ethicsCartography and Geographic Information Science10.1080/15230406.2023.2295943(1-32)Online publication date: 16-Jan-2024
https://doi.org/10.1080/15230406.2023.2295943
Lunga DHu YNewsam SGao SMartins BYang LDeng X(2022)GeoAI at ACM SIGSPATIALSIGSPATIAL Special10.1145/3578484.357849113:3(21-32)Online publication date: 23-Dec-2022
https://dl.acm.org/doi/10.1145/3578484.3578491

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten