Nothing Special   »   [go: up one dir, main page]

Skip to main content

A Software Processing Chain for Evaluating Thesaurus Quality

  • Conference paper
  • First Online:
Semantic Keyword-Based Search on Structured Data Sources (IKC 2016)

Abstract

Thesauri are knowledge models commonly used for information classification and retrieval whose structure is defined by standards that describe the main features the concepts and relations must have. However, following these standards requires a deep knowledge of the field the thesaurus is going to cover and experience in their creation. To help in this task, this paper describes a software processing chain that provides different validation components that evaluates the quality of the main thesaurus features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

Notes

  1. 1.

    https://spring.io/.

  2. 2.

    http://compling.hss.ntu.edu.sg/omw/.

References

  1. Aitchison, J., Bawden, D., Gilchrist, A.: Thesaurus Construction and Use: A Practical Manual, Routledge (2000)

    Google Scholar 

  2. Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: an architecture for development of robust HLT applications. In: Proceedings of the 40th annual meeting on association for computational linguistics, pp. 168–175. Association for Computational Linguistics (2002)

    Google Scholar 

  3. Eckert, K.: Usage-driven maintenance of knowledge organization systems. Ph.D. thesis, Universitat Mannheim (2012)

    Google Scholar 

  4. Fischer, D.H.: From thesauri towards ontologies? In: Structures and relations in knowledge organization - 5th International ISKO Conference, pp. 18–30, Lille, France, August 1998

    Google Scholar 

  5. Frakes, W.B., Baeza-Yates, R.: Thesaurus construction. In: Frakes, W.B., Baeza-Yates, R. (eds.) Information Retrieval: Data Structures & Algorithms, pp. 161–218. Addison Wesley, Reading (1992)

    Google Scholar 

  6. Gangemi, A., Guarino, N., Masolo, C., Oltramari, A.: Sweetening WORDNET with DOLCE. AI Mag. 24(3), 13–24 (2003)

    MATH  Google Scholar 

  7. International Organization for Standardization: Thesauri and interoperability with other vocabularies. ISO 25694, International Organization for Standardization (ISO) (2011)

    Google Scholar 

  8. International Organization for Standarization: Quality management and quality assurance. ISO 8402, International Organization for Standarization (1994)

    Google Scholar 

  9. Kless, D., Milton, S.: Towards quality measures for evaluating thesauri. In: Sánchez-Alonso, S., Athanasiadis, I.N. (eds.) MTSR 2010. CCIS, vol. 108, pp. 312–319. Springer, Heidelberg (2010). doi:10.1007/978-3-642-16552-8_28

    Chapter  Google Scholar 

  10. Lacasta, J., Falquet, G., Zarazaga-Soria, F.J., Nogueras-Iso, J.: An automatic method for reporting the quality of thesauri. Data Knowl. Eng. 104, 1–14 (2016)

    Article  Google Scholar 

  11. Mader, C., Haslhofer, B.: Perception and relevance of quality issues in web vocabularies. In: I-SEMANTICS 2013 Proceedings of the 9th International Conference on Semantic Systems (2013)

    Google Scholar 

  12. Miles, A., Bechhofer, S.: SKOS Simple Knowledge Organization System Reference. No. January in W3C Candidate Recommendation, W3C (2009)

    Google Scholar 

  13. Pinto, M.: A user view of the factors affecting quality of thesauri in social science databases. Libr. Inf. Sci. Res. 30(3), 216–221 (2008)

    Article  Google Scholar 

  14. Poveda-Villalón, M., Suárez-Figueroa, M.C., Gómez-Pérez, A.: Validating ontologies with OOPS!. In: Teije, A., Völker, J., Handschuh, S., Stuckenschmidt, H., d’Acquin, M., Nikolov, A., Aussenac-Gilles, N., Hernandez, N. (eds.) EKAW 2012. LNCS (LNAI), vol. 7603, pp. 267–281. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33876-2_24

    Chapter  Google Scholar 

  15. Savoy, J.: Report on CLEF-2001 experiments. Technical report, Institut interfacultaire d’informatique, Université de Neuchtel, Switzerland (2001)

    Google Scholar 

  16. Soergel, D.: Indexing Languages and Thesauri: Construction and Maintenance. Melville Pub, Company (1974)

    Google Scholar 

  17. Suominen, O., Mader, C.: Assessing and improving the quality of SKOS vocabularies. J. Data Semant. 3(1), 47–73 (2014)

    Article  Google Scholar 

  18. Tarjan, R.E.: Depth-first search and linear graph algorithms. SIAM J. Comput. 1(2), 146–160 (1972)

    Article  MathSciNet  Google Scholar 

  19. Wielemaker, J., Hildebrand, M., Ossenbruggen, J., Schreiber, G.: Thesaurus-based search in large heterogeneous collections. In: Sheth, A., Staab, S., Dean, M., Paolucci, M., Maynard, D., Finin, T., Thirunarayan, K. (eds.) ISWC 2008. LNCS, vol. 5318, pp. 695–708. Springer, Heidelberg (2008). doi:10.1007/978-3-540-88564-1_44

    Chapter  Google Scholar 

Download references

Acknowledgements

This work has been partially supported by the Keystone COST Action IC1302 and by the University of Zaragoza (project UZ2016-TEC-05).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Javier Lacasta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Lacasta, J., Falquet, G., Nogueras-Iso, J., Zarazaga-Soria, J. (2017). A Software Processing Chain for Evaluating Thesaurus Quality. In: Calì, A., Gorgan, D., Ugarte, M. (eds) Semantic Keyword-Based Search on Structured Data Sources. IKC 2016. Lecture Notes in Computer Science(), vol 10151. Springer, Cham. https://doi.org/10.1007/978-3-319-53640-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-53640-8_8

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-53639-2

  • Online ISBN: 978-3-319-53640-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics