Abstract
Geoscience knowledge graph has become a popular topic in recent years. A series of studies have been reported to introduce the construction and application of geoscience knowledge graphs from different views. The relational geoscience dataset with high knowledge density and data quality is an important digital heritage of geoscience. The relational dataset has not been taken seriously in the geoscience knowledge graph research. In this study, we proposed a quick method of building a geoscience knowledge graph using relational data mapping to triples. First, the use-case-driven method was applied to design the ontology of porphyry copper deposits. Second, the mapping rules were built based on the porphyry copper ontology. Third, the knowledge graph of the porphyry copper deposit was constructed based on relational data mapping and knowledge fusion. Based on the resulting knowledge graph, several exploratory cases were conducted to make knowledge reasoning and discovery. It is indicated that the solution proposed in this study is a fast batch-processing geoscience knowledge graph construction method. The experiences from this study can benefit the construction of knowledge graphs in other geoscience disciplines and promote knowledge discovery.
Similar content being viewed by others
Data availability
The dataset of porphyry copper deposits can be obtained from US Geological Survey (https://mrdata.usgs.gov/porcu/). The file of mapping rules is available on GitHub at https://github.com/wangcug/DataMapping.
Code availability
The mapping rules file is available on GitHub at https://github.com/wangcug/DataMapping.
References
Bergen KJ, Johnson PA, de Hoop MV, Beroza GC (2019) Machine learning for data-driven discovery in solid Earth geoscience. Science 363(6433):eaau0323. https://doi.org/10.1126/science.aau0323
Cerans K, Būmans G (2015) RDB2OWL: A Language and Tool for Database to Ontology Mapping. (Paper presented at the CAISE 2015 Forum)
Chen Q, Yao H, Li S, Li X, Kang X, Lai W, Kuang J (2023) Fact-condition statements and super relation extraction for geothermic knowledge graphs construction. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101412
Chhaya P, Lee K-H, Shin K-s, Choi C-H, Cho W-S, Lee Y-S (2016) ‘Using D2RQ and Ontop to publish relational database as Linked Data’ 2016 Eighth International Conference on Ubiquitous and Future Networks (ICUFN). IEEE, pp. 694–698
Cox SJ, Richard S (2015) A geologic timescale ontology and service. Earth Sci Inf 8:5–19
Devi R, Singh R, Singh VP (2018a) Comparative study of RDB to RDF Mapping using D2RQ and R2RML mapping languages. Int J Inform Sci Application 10(1):23–36
Devi R, Singh R, Singh VP (2018b) Comparative study of RDB to RDF Mapping using D2RQ and R2RML mapping languages. Int J Inform Sci Application 10(1):23–26
Enkhsaikhan M (2021) Geological knowledge graph construction from Mineral Exploration text. The University of Western Australia
Fan R, Wang L, Yan J, Song W, Zhu Y, Chen X (2019) Deep learning-based named Entity Recognition and Knowledge Graph Construction for Geological hazards. ISPRS Int J Geo-Information 9(1). https://doi.org/10.3390/ijgi9010015
Fensel D, Şimşek U, Angele K, Huaman E, Kärle E, Panasiuk O et al (2020) Introduction: what is a knowledge graph? Knowl Graphs 1–10. https://doi.org/10.1007/978-3-030-37439-6_1
Gil Y, Pierce SA, Babaie H, Banerjee A, Borne K, Bust G et al (2018) Intelligent systems for geosciences. Commun ACM 62(1):76–84. https://doi.org/10.1145/3192335
Hu X, Ma X, Ma C et al (2023a) The geoscience knowledge system, ontology and knowledge graph for data-driven discovery: Preface. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2023.101592
Hu X, Xu Y, Ma X, Yunqiang Z, Chao M, Chao L et al (2023b) Knowledge System, Ontology, and knowledge graph of the Deep-Time Digital Earth (DDE): Progress and Perspective. J Earth Sci 34(5):1323–1327. https://doi.org/10.1007/s12583-023-1930-1
Husson J, Peters S, Ross I, Czaplewski J (2016) (2016) Macrostrat and GeoDeepDive: A Platform for Geological Data Integration and Deep-Time Research, AGU Fall Meeting Abstracts. pp. IN23F-04
Jaccard P (1912) The distribution of the flora in the alpine zone. New Phytol 11(2):37–50
Koskela R, Ramamurthy M, Pearlman J, Lehnert K, Ahern T, Fredericks J et al (2017) Earthcube: A community-driven cyberinfrastructure for the geosciences, EGU General Assembly Conference Abstracts. p. 5884
Kumar Gond A, Dey S, Zong K, Liu Y, Anand R, Mitra A, Mitra A (2023) A better understanding of Archean crustal evolution: exploring the sedimentary archive of the Singhbhum Craton, eastern India. J Asian Earth Sci 251. https://doi.org/10.1016/j.jseaes.2023.105630
Li S, Chen J, Liu C, Wang Y (2021) Mineral Prospectivity Prediction via Convolutional neural networks based on geological Big Data. J Earth Sci 32(2):327–347. https://doi.org/10.1007/s12583-020-1365-z
Lv X, Xie Z, Xu D, Jin X, Ma K, Tao L et al (2022) Chinese Named Entity Recognition in the Geoscience Domain based on BERT. Earth Space Sci 9(3). https://doi.org/10.1029/2021ea002166
Ma X (2022) Knowledge graph construction and application in geosciences: a review. Comput Geosci 161:105082. https://doi.org/10.1016/j.cageo.2022.105082
Ma X, Ma C, Wang C (2020) A new structure for representing and tracking version information in a deep time knowledge graph. Comput Geosci 145:104620
Ma C, Morrison SM, Muscente AD, Wang C, Ma X (2022) Incorporate temporal topology in a deep-time knowledge base to facilitate data‐driven discovery in geoscience. Geosci Data J. https://doi.org/10.1002/gdj3.171
Ma C, Kale AS, Zhang J, Ma X (2023) A knowledge graph and service for regional geologic time standards. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101453
Michel F, Montagnat J, Zucker CF (2013) ‘A survey of RDB to RDF translation approaches and tools’. https://hal.archives-ouvertes.fr/hal-00903568v1
Normile D (2019) Earth scientists plan a ‘geological Google’. Science 363(6430):917. https://doi.org/10.1126/science.363.6430.917
Parsons MA, Duerr R, Godøy Ø (2023) The evolution of a geoscience standard: an instructive tale of science keyword development and adoption. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101400
Peters SE, Husson JM, Wilcots J (2017) The rise and fall of stromatolites in shallow marine environments. Geology 45(6):487–490. https://doi.org/10.1130/g38931.1
Qiu Q, Xie Z, Wu L, Tao L (2019a) GNER: a generative model for geological named entity recognition without labeled data using deep learning. Earth Space Sci 6(6):931–946. https://doi.org/10.1029/2019ea000610
Qiu Q, Xie Z, Wu L, Tao L, Li W (2019b) BiLSTM-CRF for geological named entity recognition from the geoscience literature. Earth Sci Inf 12(4):565–579. https://doi.org/10.1007/s12145-019-00390-3
Qiu Q, Ma K, Lv H, Tao L, Xie Z (2023a) Construction and application of a knowledge graph for iron deposits using text mining analytics and a deep learning algorithm. Math Geosci 55(3):423–456
Qiu Q, Wang B, Ma K, Lü H, Tao L, Xie Z (2023b) A practical Approach to constructing a geological knowledge graph: a case study of Mineral Exploration Data. J Earth Sci 34(5):1374–1389. https://doi.org/10.1007/s12583-023-1809-3
Reichstein M, Camps-Valls G, Stevens B, Jung M, Denzler J, Carvalhais N, Prabhat (2019) Deep learning and process understanding for data-driven Earth system science. Nature 566(7743):195–204. https://doi.org/10.1038/s41586-019-0912-1
Tang X, Feng Z, Xiao Y, Wang M, Ye T, Zhou Y et al (2023) Construction and application of an ontology-based domain-specific knowledge graph for petroleum exploration and development. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101426
Wang C, Ma X, Chen J (2018a) Ontology-driven data integration and visualization for exploring regional geologic time and paleontological information. Comput Geosci 115:12–19. https://doi.org/10.1016/j.cageo.2018.03.004
Wang C, Ma X, Chen J, Chen J (2018b) Information extraction and knowledge graph construction from geoscience literature. Comput Geosci 112:112–120
Wang C, Hazen RM, Cheng Q, Stephenson MH, Zhou C, Fox P et al (2021) The deep-time Digital Earth program: data-driven discovery in geosciences. Natl Sci Rev 8(9):nwab027
Wang C, Li Y, Chen J (2023a) Text mining and knowledge graph construction from geoscience literature legacy: A review. In X. Ma, M. Mookerjee, L. Hsu, & D. Hills (Eds.), Recent Advancement in Geoinformatics and Data Science (pp. 11–28). Geological Society of America. https://doi.org/10.1130/2022.2558(02)
Wang C, Li Y, Chen j, Ma X (2023b) Named entity annotation schema for geological literature mining in the domain of porphyry copper deposits. Ore Geol Rev 152:105243. https://doi.org/10.1016/j.oregeorev.2022.105243
Wang S, Zhu Y, Qi Y, Hou Z, Sun K, Li W et al (2023c) A unified framework of temporal information expression in geosciences knowledge system. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101465
Xu H, Zhao Y, Huang H, Dong S, Shi Y, Huang C et al (2023) A comprehensive construction of the domain ontology for stratigraphy. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101461
Yu C, Zhang L, Hou M, Yang J, Zhong H, Wang C (2023) Climate paleogeography knowledge graph and deep time paleoclimate classifications. Geosci Front 14(5). https://doi.org/10.1016/j.gsf.2022.101450
Zhang C (2015) DeepDive: a data management system for automatic knowledge base construction. The University of Wisconsin-Madison
Zhou X-G, Gong R-B, Shi F-G, Wang Z-F (2020) PetroKG: construction and application of knowledge graph in Upstream Area of PetroChina. J Comput Sci Technol 35(2):368–378. https://doi.org/10.1007/s11390-020-9966-7
Zhou C, Wang H, Wang C, Hou Z, Zheng Z, Shen S et al (2021) Geoscience knowledge graph in the big data era. Sci China Earth Sci 64(7):1105–1114. https://doi.org/10.1007/s11430-020-9750-4
Zhu Y, Zhou W, Xu Y, Liu J, Tan Y (2017) Intelligent Learning for Knowledge Graph towards Geological Data. Sci Program 2017:1–13. https://doi.org/10.1155/2017/5072427
Funding
This study was funded by the National Key R&D Program of China (2022YFF0801202, 2022YFF0801200), National Natural Science Foundation of China (41902305), Knowledge Innovation Program of Wuhan-Shuguang (2023010201020332).
Author information
Authors and Affiliations
Contributions
ChengbinWang: writing-original draft, conceptualization, data validation, writing-review and editing, project administration, funding acquisition. Liangquan Tan: building the mapping rules. Yuanjun Li: knowledge graph visualization and application. Mingguo Wang: writing-review and editing. Xiaogang Ma: writing-review and editing, data validation. Jianguo Chen: funding acquisition, data validation, writing-review, and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Communicated by H. Babaie.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wang, C., Tan, L., Li, Y. et al. Ontology-driven relational data mapping for constructing a knowledge graph of porphyry copper deposits. Earth Sci Inform 17, 2649–2660 (2024). https://doi.org/10.1007/s12145-024-01307-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-024-01307-5