Abstract
Sepsis is a leading cause of death among intensive care unit patients. Early sepsis prediction, which primarily relies on advanced artificial intelligence technology, is an important phase in sepsis management. In previous sepsis prediction research, tremendous time and effort are spent in building high accurate tree ensemble models. However, due to the “black box” nature of tree ensemble models, clinicians typically reluctant to use them in clinical practice. To fill the gap, a novel multi-objective rule extraction method is proposed to trade off the predictive performance and the interpretability of tree ensemble model for sepsis prediction. The proposed method encompasses two phases: rule extraction utilizing a novel multi-objective binary differential evolution algorithm (MOBDE), and rule simplification employing a greedy heuristic method. In addition, rule extraction incorporates a Tchebycheff-based competitive learning strategy and a binary mutation operation aimed at enhancing the search ability of MOBDE. Rule simplification leveraging the proposed greedy heuristic method, streamlines the extracted rules from the previous phase by eliminating redundant constraints. The 2019 PhysioNet/CinC Challenge dataset, a well-known dataset in sepsis prediction, is used to evaluate the effectiveness of the proposed method. The experimental results show that the proposed method is more interpretable and has higher accuracy than some of commonly used rule-based models for sepsis prediction. The proposed method can be popularized to practical sepsis treatment, and assist doctors in sepsis diagnosis with the comprehensible decision-making support.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data Availability
The datasets used in this study are publicly accessible. The 2019 PhysioNet/CinC Challenge dataset is available at: https://physionet.org/content/challenge-2019/1.0.0/.
Availability of supporting data
The 2019 PhysioNet/CinC Challenge dataset is available at: https://physionet.org/content/challenge-2019/1.0.0/. The code for the proposed method has been uploaded to the Figshare database (https://doi.org/10.6084/m9.figshare.25378546.v1).
References
Bandyopadhyay, S., Pal, S. K., & Aruna, B. (2004). Multiobjective gas, quantitative indices, and pattern classification. IEEE Transactions on Systems Man and Cybernetics Part B-Cybernetics, 34(5), 2088–2099. https://doi.org/10.1109/tsmcb.2004.834438
Baniasadi, A., Rezaeirad, S., Zare, H., et al. (2021). Two-step imputation and AdaBoost-based classification for early prediction of sepsis on imbalanced clinical data. Critical Care Medicine, 49(1), E91–E97. https://doi.org/10.1097/ccm.0000000000004705
Barak, V., Schwartz, A., Kalickman, I., et al. (1998). Prevalence of hypophosphatemia in sepsis and infection: The role of cytokines. American Journal of Medicine, 104(1), 40–47. https://doi.org/10.1016/s0002-9343(97)00275-1
Barton, C., Chettipally, U., Zhou, Y., et al. (2019). Evaluation of a machine learning algorithm for up to 48-hour advance prediction of sepsis using six vital signs. Computers in Biology and Medicine, 109, 79–84. https://doi.org/10.1016/j.compbiomed.2019.04.027
Calvert, J. S., Price, D. A., Chettipally, U. K., et al. (2016). A computational approach to early sepsis detection. Computers in Biology and Medicine, 74, 69–73. https://doi.org/10.1016/j.compbiomed.2016.05.003
Chan, V., & Chan, C. (2017). Towards developing the piece-wise linear neural network algorithm for rule extraction. International Journal of Cognitive Informatics and Natural Intelligence, 11(2). https://doi.org/10.4018/ijcini.2017040104
Chen, M., Huo, J., & Duan, Y. (2023). A hybrid biogeography-based optimization algorithm for three-dimensional bin size designing and packing problem. Computers & Industrial Engineering, 180, 109,239. https://doi.org/10.1016/j.cie.2023.109239
Coussement, K., & Benoit, D. F. (2021). Interpretable data science for decision making. Decision Support Systems, 150. https://doi.org/10.1016/j.dss.2021.113664
Craven, M.W., & Shavlik, J.W. (1995). Extracting tree-structured representations of trained networks. In: Proceedings of the 8th international conference on neural information processing systems (pp. 24–30). MIT Press
Deb, K., Pratap, A., Agarwal, S., et al. (2002). A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Transactions on Evolutionary Computation, 6(2), 182–197. https://doi.org/10.1109/4235.996017
Desautels, T., Calvert, J., Hoffman, J., et al. (2016). Prediction of sepsis in the intensive care unit with minimal electronic health record data: A machine learning approach. JMIR Medical Informatics, 4(3), 67–81. https://doi.org/10.2196/medinform.5909
Duan, Y., Huo, J., Chen, M., et al. (2023). Early prediction of sepsis using double fusion of deep features and handcrafted features. Applied Intelligence, 53(14), 17,903-17,919. https://doi.org/10.1007/s10489-022-04425-z
Evans, L., Rhodes, A., Alhazzani, W., et al. (2021). Surviving sepsis campaign: International guidelines for management of sepsis and septic shock 2021. Critical Care Medicine, 49(11), e1063–e1143. https://doi.org/10.1097/ccm.0000000000005337
Fleischmann, C., Scherag, A., Adhikari, N. K. J., et al. (2016). Assessment of global incidence and mortality of hospital-treated sepsis. American Journal of Respiratory and Critical Care Medicine, 193(3), 259–272. https://doi.org/10.1164/rccm.201504-0781OC
Frieman, J. H., & Popescu, B. E. (2008). Predictive learning via rule ensembles. Annals of Applied Statistics, 2(3), 916–954. https://doi.org/10.1214/07-aoas148
He, Z., Du, L., Zhang, P., et al. (2020). Early sepsis prediction using ensemble learning with deep features and artificial features extracted from clinical electronic health records. Critical Care Medicine, 48(12), E1337–E1342. https://doi.org/10.1097/ccm.0000000000004644
Javan, S. L., Sepehri, M. M., Javan, M. L., et al. (2019). An intelligent warning model for early prediction of cardiac arrest in sepsis patients. Computer Methods and Programs in Biomedicine, 178, 47–58. https://doi.org/10.1016/j.cmpb.2019.06.010
Johansson, U., Sonstrod, C., Lofstrom, T., et al. (2022). Rule extraction with guarantees from regression models. Pattern Recognition, 126. https://doi.org/10.1016/j.patcog.2022.108554
Kam, H. J., & Kim, H. Y. (2017). Learning representations for the early detection of sepsis with deep neural networks. Computers in Biology and Medicine, 89, 248–255. https://doi.org/10.1016/j.compbiomed.2017.08.015
Kim, H. I., & Park, S. (2019). Sepsis: Early recognition and optimized treatment. Tuberculosis and respiratory diseases, 82(1), 6–14.
Kok, C., Jahmunah, V., Oh, S. L., et al. (2020). Automated prediction of sepsis using temporal convolutional network. Computers in Biology and Medicine, 127. https://doi.org/10.1016/j.compbiomed.2020.103957
Lauritsen, S. M., Kalor, M. E., Kongsgaard, E. L., et al. (2020). Early detection of sepsis utilizing deep learning on electronic health record event sequences. Artificial Intelligence in Medicine, 104. https://doi.org/10.1016/j.artmed.2020.101820
Lever, A., & Mackenzie, I. (2007). Sepsis: Definition, epiderhiology, and diagnosis. BMJ-British Medical Journal, 335(7625), 879–883. https://doi.org/10.1136/bmj.39346.495880.AE
Li, K., Luo, J., Hu, Y., et al. (2020). A novel multi-strategy de algorithm for parameter optimization in support vector machine. Journal of Intelligent Information Systems, 54(3), 527–543. https://doi.org/10.1007/s10844-019-00573-w
Li, X., Xu, X., Xie, F., et al. (2020). A time-phased machine learning model for real-time prediction of sepsis in critical care. Critical Care Medicine, 48(10), E884–E888. https://doi.org/10.1097/ccm.0000000000004494
Ma, X. L., Zhang, Q. F., Tian, G. D., et al. (2018). On Tchebycheff decomposition approaches for multiobjective evolutionary optimization. IEEE Transactions on Evolutionary Computation, 22(2), 226–244. https://doi.org/10.1109/tevc.2017.2704118
Mahmud, F., Pathan, N.S., Quamruzzaman, M., et al. (2020). Early detection of sepsis in critical patients using random forest classifier. In: IEEE-Region-10 symposium (TENSYMP) - technology for impactful sustainable development (pp. 130–133). IEEE Region 10 Symposium
Mao, Q., Jay, M., Hoffman, J.L., et al. (2018). Multicentre validation of a sepsis prediction algorithm using only vital sign data in the emergency department, general ward and ICU. BMJ Open, 8(1). https://doi.org/10.1136/bmjopen-2017-017833
Mashayekhi, M., & Gras, R. (2017). Rule extraction from decision trees ensembles: New algorithms based on heuristic search and sparse group lasso methods. International Journal of Information Technology & Decision Making, 16(6), 1707–1727. https://doi.org/10.1142/s0219622017500055
Mollura, M., Mantoan, G., Romano, S., et al. (2020). The role of waveform monitoring in sepsis identification within the first hour of intensive care unit stay. In: 2020 11th conference of the European study group on cardiovascular oscillations (ESGCO) (pp. 1–2). IEEE
Molnar C (2020) Interpretable machine learning
Obregon, J., Kim, A., & Jung, J. Y. (2019). RuleCOSI: Combination and simplification of production rules from boosted decision trees for imbalanced classification. Expert Systems with Applications, 126, 64–82. https://doi.org/10.1016/j.eswa.2019.02.012
Rabby, G., & Berka, P. (2023). Multi-class classification of COVID-19 documents using machine learning algorithms. Journal of Intelligent Information Systems, 60(2), 571–591. https://doi.org/10.1007/s10844-022-00768-8
Rafiei, A., Rezaee, A., Hajati, F., et al. (2021). SSP: Early prediction of sepsis using fully connected LSTM-CNN model. Computers in Biology and Medicine, 128. https://doi.org/10.1016/j.compbiomed.2020.104110
Rudin, C. (2019). Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence, 1(5), 206–215. https://doi.org/10.1038/s42256-019-0048-x
Scherpf, M., Graesser, F., Malberg, H., et al. (2019). Predicting sepsis with a recurrent neural network using the MIMIC III database. Computers in Biology and Medicine, 113. https://doi.org/10.1016/j.compbiomed.2019.103395
Seymour, C. W., Gesten, F., Prescott, H. C., et al. (2017). Time to treatment and mortality during mandated emergency care for sepsis. New England Journal of Medicine, 376(23), 2235–2244. https://doi.org/10.1056/NEJMoa1703058
Shashikumar, S. P., Li, Q., Clifford, G. D., et al. (2017). Multiscale network representation of physiological time series for early prediction of sepsis. Physiological Measurement, 38(12), 2235–2248. https://doi.org/10.1088/1361-6579/aa9772
Shashikumar, S. P., Josef, C. S., Sharma, A., et al. (2021). DeepAISE? an interpretable and recurrent neural survival model for early prediction of sepsis. Artificial Intelligence in Medicine, 113. https://doi.org/10.1016/j.artmed.2021.102036
Singer, M., Deutschman, C. S., Seymour, C. W., et al. (2016). The third international consensus definitions for sepsis and septic shock (Sepsis-3). Jama-Journal of the American Medical Association, 315(8), 801–810. https://doi.org/10.1001/jama.2016.0287
Tokinaga, S., Lu, J., & Ikeda, Y. (2005). Neural network rule extraction by using the genetic programming and its applications to explanatory classifications. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, 88–A, 2627–2635.
Wang, D., Li, J., Sun, Y., et al. (2021). A machine learning model for accurate prediction of sepsis in ICU patients. Frontiers in Public Health, 9. https://doi.org/10.3389/fpubh.2021.754348
Wang, S., Wang, Y., Wang, D., et al. (2020). An improved random forest-based rule extraction method for breast cancer diagnosis. Applied Soft Computing, 86. https://doi.org/10.1016/j.asoc.2019.105941
Yang, M. C., Liu, C. Y., Wang, X. Y., et al. (2020). An explainable artificial intelligence predictor for early detection of sepsis. Critical Care Medicine, 48(11), E1091–E1096. https://doi.org/10.1097/ccm.0000000000004550
Yedjour, D., & Benyettou, A. (2018). Symbolic interpretation of artificial neural networks based on multiobjective genetic algorithms and association rules mining. Applied Soft Computing, 72, 177–188. https://doi.org/10.1016/j.asoc.2018.08.007
Yedjour, D., & Benyettou, A. (2018). Symbolic interpretation of artificial neural networks based on multiobjective genetic algorithms and association rules mining. Applied Soft Computing, 72, 177–188. https://doi.org/10.1016/j.asoc.2018.08.007
Zhang, D., Yin, C., Hunold, K. M., et al. (2021). An interpretable deep-learning model for early prediction of sepsis in the emergency department. Patterns (New York, NY), 2(2), 100,196-100,196. https://doi.org/10.1016/j.patter.2020.100196
Zhang, Q., & Li, H. (2007). MOEA/D: A multiobjective evolutionary algorithm based on decomposition. IEEE Transactions on Evolutionary Computation, 11(6), 712–731. https://doi.org/10.1109/TEVC.2007.892759
Zhu, P., & Hu, Q. (2013). Rule extraction from support vector machines based on consistent region covering reduction. Knowledge-Based Systems, 42, 1–8. https://doi.org/10.1016/j.knosys.2012.12.003
Zitzler, E., & Thiele, L. (1999). Multiobjective evolutionary algorithms: A comparative case study and the strength Pareto approach. IEEE Transactions on Evolutionary Computation, 3(4), 257–271. https://doi.org/10.1109/4235.797969
Acknowledgements
Not Applicable.
Funding
The work has been supported by National Natural Science Foundation of China (No. 72171176, NO. 72021002) and China-Germany Cooperation project supported by National Natural Science Foundation of China (M-310).
Author information
Authors and Affiliations
Contributions
Yongrui Duan and Mingzhou Chen conceived and designed this study. Yongrui Duan and Mingzhou Chen performed the experiments. Yongrui Duan, Mingzhou Chen and Jiazheng Huo assisted with research and writing. All authors were responsible for data interpretation and statistical analysis. All authors assisted in drafting and revising the manuscript at all stages, and all have approved it in this final form.
Corresponding author
Ethics declarations
Ethical approval
The authors state that this research complies with ethical standards. This research does not involve either human participants or animals.
Competing interests
The authors declare no competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, M., Huo, J. & Duan, Y. An interpretable model for sepsis prediction using multi-objective rule extraction. J Intell Inf Syst 62, 1403–1429 (2024). https://doi.org/10.1007/s10844-024-00859-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10844-024-00859-8