Abstract
The rapid spread of COVID-19 has resulted in a large number of infections and significant economic impact on countries worldwide, and COVID-19 testing is one of the important methods to identify infected individuals. The previous studies have indicated that with improved COVID-19 testing capabilities, more confirmed cases can be detected. Therefore, how to accurately forecast the COVID-19 testing capabilities is a key issue in controlling the spread of the pandemic. In this study, based on a dataset of COVID-19 including data from 184 countries and 893 regions, we propose a novel machine learning framework named BO–SHAP–BLS, which combines Shapley Additive Explanations (SHAP), Bayesian Optimization (BO), and Broad Learning System (BLS), for forecasting COVID-19 testing capabilities. Firstly, SHAP is used to analyze and rank the importance of the original features. Then, BO is adopted to optimize both the hyperparameters of BLS and the number of features simultaneously. Finally, BLS is adopted to predict the number of COVID-19 tests in various countries. Experimental results show that BO–SHAP–BLS significantly outperforms the other machine learning models, indicating higher accuracy in predicting the COVID-19 testing capabilities.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
References
Zhong N, Zheng B, Li Y, Poon L, Xie Z, Chan K, Li P, Tan S, Chang Q, Xie J et al (2003) Epidemiology and cause of severe acute respiratory syndrome (SARS) in Guangdong, People’s Republic of China, in February, 2003. Lancet 362(9393):1353–1358
Hui DS, Azhar EI, Madani TA, Ntoumi F, Kock R, Dar O, Ippolito G, Mchugh TD, Memish ZA, Drosten C et al (2020) The continuing 2019-ncov epidemic threat of novel coronaviruses to global health-the latest 2019 novel coronavirus outbreak in Wuhan, China. Int J Infect Dis 91:264–266
Gallo Marin B, Aghagoli G, Lavine K, Yang L, Siff EJ, Chiang SS, Salazar-Mather TP, Dumenco L, Savaria MC, Aung SN, et al (2021) Predictors of covid-19 severity: a literature review. Rev Med Virol 31(1):1–10
Ou S, He X, Ji W, Chen W, Sui L, Gan Y, Lu Z, Lin Z, Deng S, Przesmitzki S et al (2020) Machine learning model to project the impact of covid-19 on us motor gasoline demand. Nat Energy 5(9):666–673
DeFilippis E, Impink SM, Singell M, Polzer JT, Sadun R (2020) Collaborating during coronavirus: the impact of covid-19 on the nature of work. Tech. rep, National Bureau of Economic Research
Badr HS, Du H, Marshall M, Dong E, Squire MM, Gardner LM (2020) Association between mobility patterns and covid-19 transmission in the USA: a mathematical modelling study. Lancet Infect Dis 20(11):1247–1254
Guan D, Wang D, Hallegatte S, Davis SJ, Huo J, Li S, Bai Y, Lei T, Xue Q, Coffman D et al (2020) Global supply-chain effects of covid-19 control measures. Nat Hum Behav 4(6):577–587
Wang Y, Yuan Y, Wang Q, Liu C, Zhi Q, Cao J (2020) Changes in air quality related to the control of coronavirus in China: implications for traffic and industrial emissions. Sci Total Environ 731:139133
Zaki N, Mohamed EA (2021) The estimations of the covid-19 incubation period: a scoping reviews of the literature. J Infect Public Health 14(5):638–646
He X, Luo L, Tang X, Wang Q (2023) Healthcare, vol. 11 (MDPI, 2023), p 393
Zeroual A, Harrou F, Dairi A, Sun Y (2020) Deep learning methods for forecasting covid-19 time-series data: a comparative study. Chaos Solitons Fractals 140:110121
Abbasimehr H, Paki R, Bahrini A (2022) A novel approach based on combining deep learning models with statistical methods for covid-19 time series forecasting. Neural Comput Appl, pp 1–15
Malki Z, Atlam ES, Ewis A, Dagnew G, Alzighaibi AR, ELmarhomy G, Elhosseini MA, Hassanien AE, Gad I (2021) Arima models for predicting the end of covid-19 pandemic and the risk of second rebound. Neural Comput Appl 33:2929–2948
Kumar N, Susan S (2020) 2020 11th international conference on computing, communication and networking technologies (ICCCNT) (IEEE, 2020), pp 1–7
Hassantabar S, Stefano N, Ghanakota V, Ferrari A, Nicola GN, Bruno R, Marino IR, Hamidouche K, Jha NK (2021) Coviddeep: SARS-cov-2/covid-19 test based on wearable medical sensors and efficient neural networks. IEEE Trans Consum Electron 67(4):244–256
Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, Colaneri M (2020) Modelling the covid-19 epidemic and implementation of population-wide interventions in Italy. Nat Med 26(6):855–860
Abbasimehr H, Paki R (2021) Prediction of covid-19 confirmed cases combining deep learning methods and Bayesian optimization. Chaos Solitons Fractals 142:110511
Anastassopoulou C, Russo L, Tsakris A, Siettos C (2020) Data-based analysis, modelling and forecasting of the covid-19 outbreak. PLoS ONE 15(3):e0230405
Huang J, Zhang L, Liu X, Wei Y, Liu C, Lian X, Huang Z, Chou J, Liu X, Li X et al (2020) Global prediction system for covid-19 pandemic. Sci Bull 65(22):1884
Dong E, Du H, Gardner L (2020) An interactive web-based dashboard to track covid-19 in real time. Lancet Infect Dis 20(5):533–534
He F, Zhou J, Feng ZK, Liu G, Yang Y (2019) A hybrid short-term load forecasting model based on variational mode decomposition and long short-term memory networks considering relevant factors with bayesian optimization algorithm. Appl Energy 237:103–116
Shahriari B, Swersky K, Wang Z, Adams RP, De Freitas N (2015) Taking the human out of the loop: a review of Bayesian optimization. Proc IEEE 104(1):148–175
Snoek J, Larochelle H, Adams RP (2012) Practical bayesian optimization of machine learning algorithms. Adv Neural Inf Process Syst 25
Quakulinski L, Koumpis A, Beyan OD (2022) 2022 Fourth International Conference on Transdisciplinary AI (TransAI) (IEEE, 2022), pp 116–121
Lundberg SM, Lee SI (2017) A unified approach to interpreting model predictions. Adv Neural Inf Process Syst 30
Nowak AS, Radzik T (1994) The Shapley value for n-person games in generalized characteristic function form. Games Econ Behav 6(1):150–161
Chen CP, Liu Z (2017) Broad learning system: an effective and efficient incremental learning system without the need for deep architecture. IEEE Trans Neural Netw Learn Syst 29(1):10–24
Pao YH, Park GH, Sobajic DJ (1994) Learning and generalization characteristics of the random vector functional-link net. Neurocomputing 6(2):163–180
Igelnik B, Pao YH (1995) Stochastic choice of basis functions in adaptive function approximation and the functional-link net. IEEE Trans Neural Netw 6(6):1320–1329
Gong X, Zhang T, Chen CP, Liu Z (2021) Research review for broad learning system: algorithms, theory, and applications. IEEE Trans Cybern 52(9):8922–8950
John Hopkins University. Coronavirus map. https://coronavirus.jhu.edu/map.html
Zhang Z, Li X, Lyu K, Zhao X, Zhang F, Liu D, Zhao Y, Gao F, Hu J, Xu D (2023) Exploring the transmission path, influencing factors and risk of aerosol transmission of SARS-cov-2 at xi’an Xianyang international airport. Int J Environ Res Public Health 20(1):865
Zhan C, Jiang W, Min H, Gao Y, Tse C (2023) Human migration-based graph convolutional network for pm2. 5 forecasting in post-covid-19 pandemic age. Neural Comput Appl 35(9):6457–6470
Dutta A (2022) Covid-19 waves: variant dynamics and control. Sci Rep 12(1):9332
Gnana DAA, Balamurugan SAA, Leavline EJ (2016) Literature review on feature selection methods for high-dimensional data. Int J Comput Appl 136(1):9–17
Acknowledgments
This work was supported by, the Natural Science Foundation of Guangdong Province, China (2023A1515011618), and Research on Learning Emotional Intelligence Analysis and Evaluation Based on Multimodal Data (21YJAZH072) 2021 Crossdisciplinary Research Project in Humanities and Social Sciences, Ministry of Education of China.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
All authors declare that they have no conflict of interest to this work.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Zhan, C., Miao, L., Lin, J. et al. BO–SHAP–BLS: a novel machine learning framework for accurate forecasting of COVID-19 testing capabilities. Neural Comput & Applic 36, 7119–7131 (2024). https://doi.org/10.1007/s00521-024-09449-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-024-09449-9