Abstract
The Open Archives Initiative (OAI) is an experimental initiative for the interoperability of Digital Libraries (DLs) based on metadata harvesting. The goal of OAI is to develop and promote interoperability solutions to facilitate the efficient dissemination of content. At present, however, there are still several challenging issues such as metadata incorrectness, poor quality of metadata, and metadata inconsistency that have to be solved in order to create a variety of high-quality services. In this paper we propose an integrated DL system with OAI and self-organizing capabilities. The system provides two value-added services, cross-archive searching and interactive concept browsing services, for organizing, exploring, and searching a collection of harvested metadata to satisfy users’ information needs. We also propose a multi-layered Self-Organizing Map (SOM) algorithm for building a subject-specific concept hierarchy using two input vector sets constructed by indexing the harvested metadata collection. By using the concept hierarchy, we can also automatically classify the harvested metadata collection for the purpose of selective harvesting.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Lagoze, C., Van de Sompel, H.: The Open Archives Initiative: Building a low-barrier interoperability framework. In: Proceedings of the First ACM/IEEE Joint Conference on Digital Libraries, Roanoke, VA, pp. 54–62 (2001)
Lagoze, C., Van de Sompel, H.: The Open Archives Initiative Protocol for Metadata Harvesting. Open Archives Initiative (2001)
Dublin Core Metadata Initiative.: Dublin Core Metadata Element Set, Version 1.1: Reference Description (1999), http://www.dublincore.org/documents/1999/07/02/dces/
Van de Sompel, H., Krichel, T., Nelson, M.L.: The UPS Prototype: an experimental end-user service across e-print archives. D-Lib Magazine, 6(2) (2000)
Liu, X., Maly, K., Zubair, M., Hong, Q., Nelson, M.L., Knudson, F., Holtkamp, I.: Federated Searching Interfaces Techniques for Heterogeneous OAI Repositories. Journal of Digital Information 2(4) (2002)
Chen, S.: Digital Libraries: The Life Cycle of Information. Better Earth Publisher (1998)
Chen, S., Choo, C.: A DL Server with OAI Capabilities: Managing the Metadata Complexity. In: Joint Conference on Digital Libraries (JCDL 2002), Portland, OR (2002)
Powell, A.L., French, J.C., Callan, J.: The Impact of Database Selection on Distributed Searching. In: Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 232–239. ACM, New York (2000)
Powell, A.L., French, J.C.: Growth and Server Availability of the NCSTL Digital Library. In: Proceedings of 5th ACM Conference on Digital Libraries, pp. 264–265 (2000)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. ACM Press, New York (1999)
Vesanto, J., Alhoniemi, E.: Clustering of the Self-Organizing Map. IEEE Transactions on Neural Networks 11(3), 586–600 (2000)
Chakrabarti, S.: Data mining for hypertext: A tutorial survey. In: ACM SIGKDD Explorations, vol. 1(2), pp. 1–11 (2000)
Chen, H., Schuffels, C., Orwig, R.: Internet Categorization and Search: A Self- Organizing Approach. Journal of Visual Communication and Image Representation 7(1), 88–102 (1996)
Kohonen, T.: Self-Organization of Very Large Document Collection: State of the Art. In: Proceedings of ICANN 1998, the 8th International Conference on Artificial Neural Networks, Skovde, Sweden (1998)
Roussinov, D., Chen, H.: A Scalable Self-Organizing Map Algorithm for Textual Classification: A Neural Network Approach to Thesaurus Generation. Communication Cognition and Artificial Intelligence 15(1-2), 81–111 (1998)
Dittenbach, M., Merkl, D., Rauber, A.: The Growing Hierarchical Self-Organizing Map. In: Proceedings of the International Joint Conference on Neural Networks (IJCNN 2000), vol. 6, pp. 15–19 (2000)
Kohonen, T.: Self-Organizing Maps, 3rd edn. Springer, Berlin (2001)
Hillmann, D.: Using Dublin Core. Dublin Core Metadata Initiative Recommendation (2001), http://www.dublincore.org/documents/2001/04/12/usageguide/
Salton, G., Buckley, C.: Term-Weighting Approaches in Automatic Text Retrieval. Information Processing and Management 24(5), 513–523 (1988)
Kohonen, T., Kaski, S., Lagus, K., Salojärvi, J., Honkela, J., Paatero, V., Saarela, A.: Self Organizing of a Massive Document Collection. IEEE Transactions on Neural Networks 11(3) (May 2000)
Brill, E.: A Simple Rule-based Part of Speech Tagger. In: Proceedings of the 3rd Conference on Applied Natural Language Processing, Trento, Italy (1992)
Brill, E.: Some advances in transformation-based part of speech tagging. In: Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA (1994)
Liu, X., Maly, K., Zubair, M., Nelson, M.L.: Arc - An OAI Service Provider for Digital Library Federation. D-Lib Magazine 7(4) (April 2001)
Suleman, H., Fox, E.A.: Beyond Harvesting: Digital Library Components as OAI Extensions. Technical report, Virginia Tech Dept. of Computer Science (January 2002)
Liu, X., Brody, T., Harnad, S., Carr, L., Maly, K., Zubair, M., Nelson, M.L.: A Scalable Architecture for Harvest-Based Digital Libraries - The ODU/Southampton Experiments. D-Lib Magazine 8(11) (November 2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, H., Choo, CY., Chen, SS. (2003). An Integrated Digital Library Server with OAI and Self-Organizing Capabilities. In: Koch, T., Sølvberg, I.T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2003. Lecture Notes in Computer Science, vol 2769. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45175-4_16
Download citation
DOI: https://doi.org/10.1007/978-3-540-45175-4_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40726-3
Online ISBN: 978-3-540-45175-4
eBook Packages: Springer Book Archive