Abstract
Data citation has become a prevalent practice within the scientific community, serving the purpose of facilitating data discovery, reproducibility, and credit attribution. Consequently, data has gained significant importance in the scholarly process. Despite its growing prominence, data citation is still at an early stage, with considerable variations in practices observed across scientific domains. Such diversity hampers the ability to consistently analyze, detect, and quantify data citations.
We focus on the European Marine Science (MES) community to examine how data is cited in this specific context. We identify four types of data citations: formal, informal, complete, and incomplete. By analyzing the usage of these diverse data citation modalities, we investigate their impact on the widespread adoption of data citation practices.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
References
Altman, M., Borgman, C., Crosas, M., Matone, M.: An introduction to the joint principles for data citation. Bull. Assoc. Inf. Sci. Technol. 41(3), 43–45 (2015)
Belter, C.W.: Measuring the value of research data: a citation analysis of oceanographic data sets. PLoS ONE 9(3), e92590 (2014)
Buneman, P., Dosso, D., Lissandrini, M., Silvello, G.: Data citation and the citation graph. Quant. Sci. Stud. 2(4), 1399–1422 (2021)
Callaghan, S.: Preserving the integrity of the scientific record: data citation and linking. Learn. Publish. 27(5), S15–S24 (2014)
Chao, T.C.: Disciplinary reach: investigating the impact of dataset reuse in the earth sciences. Proc. Am. Soc. Inf. Sci. Technol. 48(1), 1–8 (2011)
Costello, M.J., Michener, W.K., Gahegan, M., Zhang, Z.Q., Bourne, P.E.: Biodiversity data should be published, cited, and peer reviewed. Trends Ecol. Evol. 28(8), 454–461 (2013)
Cousijn, H., et al.: A data citation roadmap for scientific publishers. Sci. Data 5(1), 1–11 (2018)
Crosas, M.: The evolution of data citation: from principles to implementation. IASSIST Q. 37(1–4), 62–62 (2014)
Duval, E.: Metadata standards: what, who & why. J. Univers. Comput. Sci. 7(7), 591–601 (2001)
Fenner, M., et al.: A data citation roadmap for scholarly data repositories. Sci. Data 6(1), 28 (2019)
He, L., Nahar, V.: Reuse of scientific data in academic publications: an investigation of Dryad Digital Repository. Aslib J. Inf. Manag. (2016)
Huang, Y.H., Rose, P.W., Hsu, C.N.: Citing a data repository: a case study of the protein data bank. PLoS ONE 10(8), e0136631 (2015)
Irrera, O.: MES citations scholarly graph (2023). https://doi.org/10.5281/zenodo.8006578
Irrera, O., Mannocci, A., Manghi, P., Silvello, G.: A novel curated scholarly graph connecting textual and data publications. J. Data Inf. Qual. (2023). https://doi.org/10.1145/3597310
Lopez, P.: GROBID: combining automatic bibliographic data recognition and term extraction for scholarship publications. In: Agosti, M., Borbinha, J., Kapidakis, S., Papatheodorou, C., Tsakonas, G. (eds.) ECDL 2009. LNCS, vol. 5714, pp. 473–474. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04346-8_62
Mannocci, A., Irrera, O., Manghi, P.: Open science and authorship of supplementary material. Evidence from a research community. arXiv preprint arXiv:2207.02775 (2022)
Mooney, H.: Citing data sources in the social sciences: do authors do it? Learn. Publish. 24(2), 99–108 (2011)
Mooney, H., Newton, M.P.: The anatomy of a data citation: discovery, reuse, and credit. J. Librariansh. Scholar. Commun. 1(1) (2012)
Park, H., Wolfram, D.: An examination of research data sharing and re-use: implications for data citation practice. Scientometrics 111(1), 443–461 (2017). https://doi.org/10.1007/s11192-017-2240-2
Park, H., You, S., Wolfram, D.: Informal data citation for data sharing and reuse is more common than formal data citation in biomedical fields. J. Am. Soc. Inf. Sci. 69(11), 1346–1354 (2018)
Parsons, M.A., Duerr, R., Minster, J.B.: Data citation and peer review. EOS Trans. Am. Geophys. Union 91(34), 297–298 (2010)
Pepe, A., Goodman, A., Muench, A., Crosas, M., Erdmann, C.: How do astronomers share data? Reliability and persistence of datasets linked in AAS publications and a qualitative study of data practices among US astronomers. PLoS ONE 9(8), e104798 (2014)
Peroni, S., Shotton, D.: OpenCitations, an infrastructure organization for open scholarship. Quant. Sci. Stud. 1(1), 428–444 (2020)
Robinson-García, N., Jiménez-Contreras, E., Torres-Salinas, D.: Analyzing data citation practices using the data citation index. J. Am. Soc. Inf. Sci. 67(12), 2964–2975 (2016)
Silvello, G.: Theory and practice of data citation. J. Am. Soc. Inf. Sci. 69(1), 6–20 (2018)
Walton, D.W.: Data citation-Moving to new norms. Antarct. Sci. 22(4), 333–333 (2010)
Yoon, J., Chung, E., Lee, J.Y., Kim, J.: How research data is cited in scholarly literature: a case study of HINTS. Learn. Publish. 32(3), 199–206 (2019)
Zhao, M., Yan, E., Li, K.: Data set mentions and citations: a content analysis of full-text publications. J. Am. Soc. Inf. Sci. 69(1), 32–46 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Irrera, O., Mannocci, A., Manghi, P., Silvello, G. (2023). Tracing Data Footprints: Formal and Informal Data Citations in the Scientific Literature. In: Alonso, O., Cousijn, H., Silvello, G., Marrero, M., Teixeira Lopes, C., Marchesin, S. (eds) Linking Theory and Practice of Digital Libraries. TPDL 2023. Lecture Notes in Computer Science, vol 14241. Springer, Cham. https://doi.org/10.1007/978-3-031-43849-3_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-43849-3_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43848-6
Online ISBN: 978-3-031-43849-3
eBook Packages: Computer ScienceComputer Science (R0)