Abstract
Since the 1990s, advancements in big data and information technology have increasingly driven data-centric research in the field of Library and Information Science (LIS). To assess the influence of this data-driven research paradigm on the LIS discipline, this study conducts a fine-grained analysis to uncover the evolutionary trends of research methods within the domain. Using academic papers from LIS published between 1990 and 2022, four key categories of data-driven method entities are automatically extracted: algorithms and models, data resources, software and tools, and metrics. Based on these entities, the study examines the evolution of LIS research methods from three dimensions: the characteristics of research method entities over time, their evolution within different research topics, and the evolutionary features of research method entities across various research methods. The findings highlight data resources as a pivotal driver of methodological evolution in LIS, revealing a cyclical pattern of "emergence-stability/practical application" in the development of research methods within the field.
Similar content being viewed by others
References
Angelov, D. (2020). Top2Vec: Distributed Representations of Topics. arXiv preprint arXiv: 2008.09470. https://doi.org/10.48550/arXiv.2008.09470
Burrough-Boenisch, J. (1999). International reading strategies for IMRD articles. Written Communication, 16(3), 296–316. https://doi.org/10.1177/0741088399016003002
Chu, H. (2015). Research methods in library and information science: A content analysis. Library & Information Science Research, 37(1), 36–41. https://doi.org/10.1016/j.lisr.2014.09.003
Chu, H., & Ke, Q. (2017). Research methods: What’s in the name? Library & Information Science Research, 39(4), 284–294. https://doi.org/10.1016/j.lisr.2017.11.001
Cortes, C., & Vapnik, V. (1995). Support vector machine. Machine Learning, 20(3), 273–297.
de Waard, A. (2007). A pragmatic structure for research articles. Proceedings of the 2nd International Conference on Pragmatic Web. https://doi.org/10.1145/13242371324247
de Waard, A., & Tel, G. (2006). The ABCDE Format Enabling Semantic Conference Proceedings. Proceedings of the First Workshop on Semantic Wikis - From Wiki to Semantics (SemWiki200), 97–105.
Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv preprint arXiv:1810.04805. http://arxiv.org/abs/1810.04805
Duck, G., Kovacevic, A., Robertson, D. L., Stevens, R., & Nenadic, G. (2015). Ambiguity and variability of database and software names in bioinformatics. Journal of Biomedical Semantics, 6, 1–11. https://doi.org/10.1186/s13326-015-0026-0
Graves, A., & Schmidhuber, J. (2005). Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Neural Networks, 18(5–6), 602–610. https://doi.org/10.1016/j.neunet.2005.06.042
Han, X. (2020). Evolution of research topics in LIS between 1996 and 2019: An analysis based on latent Dirichlet allocation topic model. Scientometrics, 125(3), 2561–2595. https://doi.org/10.1007/s11192-020-03721-0
F.A.P. Harmsze. (2000). A modular structure for scientific articles in an electronic environment. PhD thesis, University of Amsterdam. https://dare.uva.nl/search?identifier=77377886-f6d8-4a48-aaf5-042f400b0e28
Ibrahim, B. (2021). Statistical methods used in Arabic journals of library and information science. Scientometrics, 126(5), 4383–4416. https://doi.org/10.1007/s11192-021-03913-2
Jaccard, P. (2010). The distribution of the flora in the alpine zone. New Phytologist, 11(2), 37–50.
Järvelin, K., & Vakkari, P. (2021). LIS research across 50 years: Content analysis of journal articles. Journal of Documentation, 78(7), 65–88. https://doi.org/10.1108/JD-03-2021-0062
Kalim, W. B., & Mercer, R. E. (2022, October). Method Entity Extraction from Biomedical Texts. Proceedings of the 29th International Conference on Computational Linguistics (pp. 2357–2362). https://aclanthology.org/2022.coling-1.207
Lafferty, J. D., McCallum, A., & Pereira, F. C. N. (2001). Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. Proceedings of the Eighteenth International Conference on Machine Learning, 282–289.
Lou, W., Su, Z., He, J., & Li, K. (2021). A temporally dynamic examination of research method usage in the Chinese library and information science community. Information Processing & Management, 58(5), 102686. https://doi.org/10.1016/j.ipm.2021.102686
Lu, W., Huang, Y., Bu, Y., & Cheng, Q. (2018). Functional structure identification of scientific documents in computer science. Scientometrics, 115(1), 463–486. https://doi.org/10.1007/s11192-018-2640-y
Ma, J., & Yuan, H. (2019). Bi-LSTM+CRF-based named entity recognition in scientific papers in the field of ecological restoration technology. Proceedings of the Association for Information Science and Technology, 56(1), 186–195. https://doi.org/10.1002/pra2.16
McCallum, A., Freitag, D., & Pereira, F. C. N. (2000). Maximum Entropy Markov Models for Information Extraction and Segmentation. Proceedings of the Seventeenth International Conference on Machine Learning, 591–598.
McCallum, A., & Li, W. (2003). Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons. Proceedings of the Seventh Conference on Natural Language Learning at HLT-NAACL 2003, 188–191. https://aclanthology.org/W03-0430
Nunkoo, R., Thelwall, M., Ladsawut, J., & Goolaup, S. (2020). Three decades of tourism scholarship: Gender, collaboration and research methods. Tourism Management, 78, 104056. https://doi.org/10.1016/j.tourman.2019.104056
Rabiner, L. R. (1989). A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE, 77(2), Article 2. Proceedings of the IEEE. https://doi.org/10.1109/5.18626
Raffel, C., Shazeer, N., Roberts, A., Lee, K., Narang, S., Matena, M., Zhou, Y., Li, W., & Liu, P. J. (2020). Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer. Journal of Machine Learning Research, 1(140), 1–67.
Tang, R., Mehra, B., Du, J. T., & Zhao, Y. C. (2021a). Framing a discussion on paradigm shift(s) in the field of information. Journal of the Association for Information Science and Technology. https://doi.org/10.1002/asi.24404
Tang, R., Mehra, B., Du, J. T., & Zhao, Y. C. (2021b). Paradigm shift in the field of information special issue editorial. Journal of the Association for Information Science and Technology. https://doi.org/10.1002/asi.24566
Vakkari, P. (2024). What characterizes LIS as a fragmenting discipline? Journal of Documentation, 80(7), 60–77. https://doi.org/10.1108/JD-10-2023-0207
Vakkari, P., Chang, Y., & Järvelin, K. (2022). Disciplinary contributions to research topics and methodology in library and information science—leading to fragmentation? Journal of the Association for Information Science and Technology, 73(12), 1706–1722. https://doi.org/10.1002/asi.24690
Wang, Y., Zhang, C., & Li, K. (2022). A review on method entities in the academic literature: Extraction, evaluation, and application. Scientometrics, 127(5), 2479–2520.
Zhang, C., & Tian, L. (2023). Non-synchronism in global usage of research methods in library and information science from 1990 to 2019. Scientometrics, 128(7), 3981–4006. https://doi.org/10.1007/s11192-023-04740-3
Zhang, C., Tian, L., & Chu, H. (2023a). Usage frequency and application variety of research methods in library and information science: Continuous investigation from 1991 to 2021. Information Processing & Management, 60(6), 103507. https://doi.org/10.1016/j.ipm.2023.103507
Zhang, C., Wang, F., Huang, Y., & Chang, L. (2023b). Interdisciplinarity of information science: An evolutionary perspective of theory application. Journal of Documentation, 80(2), 392–426. https://doi.org/10.1108/JD-07-2023-0135
Zhang, C., Wei, S., Zhao, Y., & Tian, L. (2023c). Gender differences in research topic and method selection in library and information science: Perspectives from three top journals. Library & Information Science Research, 45(3), 101255. https://doi.org/10.1016/j.lisr.2023.101255
Zhang, H., Zhang, C., & Wang, Y. (2024). Revealing the technology development of natural language processing: A scientific entity-centric perspective. Information Processing & Management, 61(1), 103574. https://doi.org/10.1016/j.ipm.2023.103574
Acknowledgements
This work was supported by the National Natural Science Foundation of China (Grant No.72074113).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhang, C., Mao, Y. & Peng, S. Data-driven evolution of library and information science research methods (1990–2022): a perspective based on fine-grained method entities. Scientometrics 129, 7889–7912 (2024). https://doi.org/10.1007/s11192-024-05202-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11192-024-05202-0