Abstract
Thesauri are knowledge models commonly used for information classification and retrieval whose structure is defined by standards that describe the main features the concepts and relations must have. However, following these standards requires a deep knowledge of the field the thesaurus is going to cover and experience in their creation. To help in this task, this paper describes a software processing chain that provides different validation components that evaluates the quality of the main thesaurus features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Aitchison, J., Bawden, D., Gilchrist, A.: Thesaurus Construction and Use: A Practical Manual, Routledge (2000)
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: an architecture for development of robust HLT applications. In: Proceedings of the 40th annual meeting on association for computational linguistics, pp. 168–175. Association for Computational Linguistics (2002)
Eckert, K.: Usage-driven maintenance of knowledge organization systems. Ph.D. thesis, Universitat Mannheim (2012)
Fischer, D.H.: From thesauri towards ontologies? In: Structures and relations in knowledge organization - 5th International ISKO Conference, pp. 18–30, Lille, France, August 1998
Frakes, W.B., Baeza-Yates, R.: Thesaurus construction. In: Frakes, W.B., Baeza-Yates, R. (eds.) Information Retrieval: Data Structures & Algorithms, pp. 161–218. Addison Wesley, Reading (1992)
Gangemi, A., Guarino, N., Masolo, C., Oltramari, A.: Sweetening WORDNET with DOLCE. AI Mag. 24(3), 13–24 (2003)
International Organization for Standardization: Thesauri and interoperability with other vocabularies. ISO 25694, International Organization for Standardization (ISO) (2011)
International Organization for Standarization: Quality management and quality assurance. ISO 8402, International Organization for Standarization (1994)
Kless, D., Milton, S.: Towards quality measures for evaluating thesauri. In: Sánchez-Alonso, S., Athanasiadis, I.N. (eds.) MTSR 2010. CCIS, vol. 108, pp. 312–319. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16552-8_28
Lacasta, J., Falquet, G., Zarazaga-Soria, F.J., Nogueras-Iso, J.: An automatic method for reporting the quality of thesauri. Data Knowl. Eng. 104, 1–14 (2016)
Mader, C., Haslhofer, B.: Perception and relevance of quality issues in web vocabularies. In: I-SEMANTICS 2013 Proceedings of the 9th International Conference on Semantic Systems (2013)
Miles, A., Bechhofer, S.: SKOS Simple Knowledge Organization System Reference. No. January in W3C Candidate Recommendation, W3C (2009)
Pinto, M.: A user view of the factors affecting quality of thesauri in social science databases. Libr. Inf. Sci. Res. 30(3), 216–221 (2008)
Poveda-Villalón, M., Suárez-Figueroa, M.C., Gómez-Pérez, A.: Validating ontologies with OOPS!. In: Teije, A., Völker, J., Handschuh, S., Stuckenschmidt, H., d’Acquin, M., Nikolov, A., Aussenac-Gilles, N., Hernandez, N. (eds.) EKAW 2012. LNCS (LNAI), vol. 7603, pp. 267–281. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33876-2_24
Savoy, J.: Report on CLEF-2001 experiments. Technical report, Institut interfacultaire d’informatique, Université de Neuchtel, Switzerland (2001)
Soergel, D.: Indexing Languages and Thesauri: Construction and Maintenance. Melville Pub, Company (1974)
Suominen, O., Mader, C.: Assessing and improving the quality of SKOS vocabularies. J. Data Semant. 3(1), 47–73 (2014)
Tarjan, R.E.: Depth-first search and linear graph algorithms. SIAM J. Comput. 1(2), 146–160 (1972)
Wielemaker, J., Hildebrand, M., Ossenbruggen, J., Schreiber, G.: Thesaurus-based search in large heterogeneous collections. In: Sheth, A., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 695–708. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88564-1_44
Acknowledgements
This work has been partially supported by the Keystone COST Action IC1302 and by the University of Zaragoza (project UZ2016-TEC-05).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Lacasta, J., Falquet, G., Nogueras-Iso, J., Zarazaga-Soria, J. (2017). A Software Processing Chain for Evaluating Thesaurus Quality. In: Calì, A., Gorgan, D., Ugarte, M. (eds) Semantic Keyword-Based Search on Structured Data Sources. IKC 2016. Lecture Notes in Computer Science(), vol 10151. Springer, Cham. https://doi.org/10.1007/978-3-319-53640-8_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-53640-8_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-53639-2
Online ISBN: 978-3-319-53640-8
eBook Packages: Computer ScienceComputer Science (R0)