Abstract
Digital repositories have been used by Universities and Libraries to store their bibliographic, scientific, and/or institutional contents, and then make their corresponding metadata publicly available to the web and through the OAI-PMH protocol. However, such metadata is not descriptive enough for a document to be easily discoverable. Even though the emergence of Semantic Web technologies have produced the interest of Digital Repository providers to publish and enrich their content using Linked Data (LD) technologies, those institutions have used different generation approaches, and in certain cases ad-hoc solutions to solve particular use cases, but none of them has performed a comparison between existing approaches in order to demonstrate which one is the best solution prior to its application. In order to address this question, we have performed a benchmark study that compares two commonly used generation approaches, and also describes our experience, lessons learned and challenges found during the process of publishing a DSpace digital repository as LD. Results show that the straightforward method for extracting data from a digital repository is through the standard OAI-PMH protocol, whose performance in terms of execution time is much shorter than the database approach, while additional data cleaning tasks are minimal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
- 15.
- 16.
- 17.
- 18.
- 19.
- 20.
- 21.
- 22.
- 23.
- 24.
- 25.
- 26.
Extract, Transform, Load (ETL) process in data warehousing.
References
Villazón-Terrazas, B., Vilches-Blázquez, L.M., Corcho, O., Gómez-Pérez, A.: Methodological guidelines for publishing government linked data. In: Wood, D. (ed.) Linking Government Data, pp. 27–49. Springer, Heidelberg (2011). https://doi.org/10.1007/978-1-4614-1767-5_2
Alexopoulos, A.D., Koutsomitropoulos, D., Papatheodorou, T.S., Solomou, G.D.: Digital repositories and the semantic web: semantic search and navigation for DSpace. Georgia Institute of Technology (2009)
Koutsomitropoulos, D., Solomou, G.D., Papatheodorou, T.S.: Semantic interoperability of dublin core metadata in digital repositories. In: 2008 International Conference on Innovations in Information Technology (2008)
Koutsomitropoulos, D.A., Solomou, G.D., Domenech, R.: Dspace semantic search v2. 0: what’s new and current status. In: Proceeding of the 7th International Conference on Open Repositories (OR 2012), 9–13 July, Edinburgh (2012)
Koutsomitropoulos, D.A., Solomou, G.D., Papatheodorou, T.S.: Semantic query answering in digital repositories: semantic search v2 for DSpace. Int. J. Metadata Semant. Ontol. 8(1), 46–55 (2013)
Haslhofer, B., Schandl, B.: The OAI2LOD server: exposing OAI-PMH metadata as linked data. In: International Workshop on Linked Data on the Web (LDOW2008), Co-located with WWW 2008, Beijing, April 2008
Piedra, N., Chicaiza, J., Lopez-Vargas, J., Caro, E.T.: Guidelines to producing structured interoperable data from open access repositories. In: 2016 IEEE Frontiers in Education Conference (FIE), pp. 1–9. IEEE (2016)
Segarra, J., Ortiz, J., Espinoza, M., Saquicela, V.: Integration of digital repositories through federated queries using semantic technologies. In: 2016 XLII Latin American Computing Conference (CLEI), pp. 1–9. IEEE (2016)
Vila-Suero, D., Villazón-Terrazas, B., Gómez-Pérez, A.: Datos. bne. es: a library linked dataset. Semant. Web 4(3), 307–313 (2013)
Lampert, C.K., Southwick, S.B.: Leading to linking: introducing linked data to academic library digital collections. J. Libr. Metadata 13(2–3), 230–253 (2013)
Southwick, S.B.: A guide for transforming digital collections metadata into linked data using open source technologies. J. Libr. Metadata 15(1), 1–35 (2015)
Berners-Lee, T.: Linked Data - Design Issues, July 2006. http://www.w3.org/DesignIssues/LinkedData.html. Accessed 12 Jan 2017
Latif, A., Scherp, A., Tochtermann, K.: LOD for library science: benefits of applying linked open data in the digital library setting. KI-Künstliche Intelligenz 30(2), 149–157 (2016)
Das, S., Sundara, S., Cyganiak, R.: R2RML: RDB to RDF mapping language, September 2012. http://www.w3.org/TR/r2rml/. Accessed 13 June 2017
Hidalgo-Delgado, Y., Estrada-Nelson, R., Xu, B., Villazon-Terrazas, B., Leiva-Maderos, A., Tello, A.: Methodological guidelines for publishing library data as linked data. In: 2017 IEEE International Conference on Big Data (2017)
Anibaldi, S., Jaques, Y., Celli, F., Stellato, A., Keizer, J.: Migrating bibliographic datasets to the semantic web: the AGRIS case. Semant. Web 6(2), 113–120 (2015)
Konstantinou, N., Spanos, D.E., Houssos, N., Mitrou, N.: Exposing scholarly information as linked open data: Rdfizing DSpace contents. Electron. Libr. 32(6), 834–851 (2014)
Konstantinou, N., Spanos, D.-E.: Creating linked data from relational databases. In: Konstantinou, N., Spanos, D.-E. (eds.) Materializing the Web of Linked Data, pp. 73–102. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16074-0_4
Koutsomitropoulos, D.A., Solomou, G.D., Kalou, A.K.: Herding linked data: semantic search and navigation among scholarly datasets. Int. J. Semant. Comput. 9(04), 459–482 (2015)
Piedra, N., Chicaiza, J., Quichimbo, P.: Integración semántica de recursos educativos abiertos cosechados con oai-pmh. proceso aplicado al servicio de búsqueda de oers en la red esvial. de Formación virtual inclusiva y de calidad para el siglo XXI CAFVIR, pp. 337–351 (2015)
Piedra, N., et al.: Marco de trabajo para la integración de recursos digitales basado en un enfoque de web semántica. RISTI-Revista Ibérica de Sistemas e Tecnologias de Informação, pp. 55–70 (2015)
Hyland, B., Stones, R., Atemezing, I.G., EURECOM, Villazón-Terrazas, B., iSOCO, S.A., I.S.C.: Best practices for publishing linked data, January 2014. http://www.w3.org/TR/ld-bp/. Accessed 12 June 2017
Sauermann, L., Cyganiak, R., Ayers, D., Völkel, M.: Cool URIs for the semantic web, December 2008. http://www.w3.org/TR/cooluris/. Accessed 12 June 2017
Rodríguez Doncel, V., Gómez-Pérez, A., Villata, S.: A dataset of RDF licenses. In: Legal Knowledge and Information Systems: JURIX 2014: The Twenty-Seventh Annual Conference (2014)
Saquicela, V., et al.: LOD-GF: an integral linked open data generation framework. In: Botto-Tobar, M., Barba-Maggi, L., González-Huerta, J., Villacrés-Cevallos, P., S. Gómez, O., Uvidia-Fassler, M.I. (eds.) TICEC 2018. AISC, vol. 884, pp. 283–300. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-02828-2_21
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Gonzalez-Toral, S., Espinoza-Mejia, M., Saquicela, V. (2019). Digital Repositories and Linked Data: Lessons Learned and Challenges. In: Villazón-Terrazas, B., Hidalgo-Delgado, Y. (eds) Knowledge Graphs and Semantic Web. KGSWC 2019. Communications in Computer and Information Science, vol 1029. Springer, Cham. https://doi.org/10.1007/978-3-030-21395-4_4
Download citation
DOI: https://doi.org/10.1007/978-3-030-21395-4_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-21394-7
Online ISBN: 978-3-030-21395-4
eBook Packages: Computer ScienceComputer Science (R0)