Abstract
In maintaining Digital Libraries, having bibliographic data up-to-date is critical, yet often minor irregularities may cause information isolation. Unlike documents for which various kinds of unique ID systems exist (e.g., DOI, ISBN), other bibliographic entities such as author and publication venue do not have unique IDs. Therefore, in current Digital Libraries, tracking such bibliographic entities is not trivial. For instance, suppose a scholar changes her last name from A to B. Then, a user, searching for her publications under the new name B, cannot get old publications that appeared under A although they are by the same person. For such a scenario, since both A and B are the same person, it would be desirable for Digital Libraries to track their identities accordingly. In this paper, we investigate this problem known as name authority control, and present our system-oriented solution. We first identify three core building blocks that underlie the phenomenon, and show taxonomy where different combinations of the building blocks can occur. Then, we consider how systems can support the problem in two common functions of Digital Libraries – Update and Search. Finally, our test-bed called OpenDBLP is presented where the suggested solution is fully implemented as a proof of the concept.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Hong, Y., Lee, D.: OpenDBLP: Rejuvenating the DBLP into Web Service Based Programmable Digital Library. Technical report, Penn State University (2004)
Hernandez, M.A., Stolfo, S.J.: The Merge/Purge Problem for Large Databases. In: ACM SIGMOD (1995)
Ley, M.: The DBLP Computer Science Bibliography: Evolution, Research Issues, Perspectives. In: SPIRE, Lisbon, Portugal (September 2002)
Warnner, J.W., Brown, E.W.: Automated Name Authority Control. In: ACM/IEEE JCDL (2001)
Davis, P.T., Elson, D.K., Klavans, J.L.: Methods for Precise Named Entity Matching in Digital Collections. In: ACM/IEEE JCDL (2003)
Synman, M.M.M., van Rensburg, M.J.: Revolutionizing Name Authority Control. In: ACM DL (2000)
Cruz, J.M.B., Klink, N.J.R., Krichel, T.: Personal Data in a Large Digital Library. In: Borbinha, J.L., Baker, T. (eds.) ECDL 2000. LNCS, vol. 1923, p. 127. Springer, Heidelberg (2000)
Han, H., Giles, C.L., Zha, H., et al.: Two Supervised Learning Approaches for Name Disambiguation in Author Citations. In: ACM/IEEE JCDL (2004)
CiteSeer: Scientific Literature Digital Library, http://citeseer.ist.psu.edu/
arXiv.org e-Print archive, http://arxiv.org/
Atkins, H., Lyons, C., Ratner, H., Risher, C., Shillum, C., Sidman, D., Stevens, A., Arms, W.: Reference Linking with DOIs: A Case Study. D-Lib Magazine (2000)
The Open Citation Project, http://opcit.eprints.org/
Fellegi, P., Sunter, A.B.: A Theory for Record Linkage. J. of the American Statistical Society 64, 1183–1210 (1969)
Pasula, H., Marthi, B., Milch, B., Russell, S., Shpitser, I.: Identity Uncertainty and Citation Matching. In: Advances in Neural Info. Processing Sys. MIT Press, Cambridge (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hong, Y., On, BW., Lee, D. (2004). System Support for Name Authority Control Problem in Digital Libraries: OpenDBLP Approach. In: Heery, R., Lyon, L. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2004. Lecture Notes in Computer Science, vol 3232. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30230-8_13
Download citation
DOI: https://doi.org/10.1007/978-3-540-30230-8_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23013-7
Online ISBN: 978-3-540-30230-8
eBook Packages: Springer Book Archive