research-article

HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units

Authors:

Satria Priambada,

Alexey TumanovAuthors Info & Claims

KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

Pages 1614 - 1624

https://doi.org/10.1145/3394486.3403212

Published: 20 August 2020 Publication History

Abstract

Deep learning models have achieved expert-level performance in healthcare with an exclusive focus on training accurate models. However, in many clinical environments such as intensive care unit (ICU), real-time model serving is equally if not more important than accuracy, because in ICU patient care is simultaneously more urgent and more expensive. Clinical decisions and their timeliness, therefore, directly affect both the patient outcome and the cost of care. To make timely decisions, we argue the underlying serving system must be latency-aware. To compound the challenge, health analytic applications often require a combination of models instead of a single model, to better specialize individual models for different targets, multi-modal data, different prediction windows, and potentially personalized predictions. To address these challenges, we propose HOLMES---an online model ensemble serving framework for healthcare applications. HOLMES dynamically identifies the best performing set of models to ensemble for highest accuracy, while also satisfying sub-second latency constraints on end-to-end prediction. We demonstrate that HOLMES is able to navigate the accuracy/latency tradeoff efficiently, compose the ensemble, and serve the model ensemble pipeline, scaling to simultaneously streaming data from 100 patients, each producing waveform data at 250~Hz. HOLMES outperforms the conventional offline batch-processed inference for the same clinical task in terms of accuracy and latency (by order of magnitude). HOLMES is tested on risk prediction task on pediatric cardio ICU data with above 95% prediction accuracy and sub-second latency on 64-bed simulation.

References

[1]

Sébastien Bailly, Geert Meyfroidt, and Jean-François Timsit. 2018. What's new in ICU in 2050: big data and machine learning. Intensive care medicine 44, 9 (2018),1524--1527.

[2]

James Bergstra and Yoshua Bengio. 2012. Random search for hyper-parameter optimization. Journal of machine learning research 13, Feb (2012), 281--305.

Digital Library

[3]

James Bergstra, Daniel Yamins, and David Daniel Cox. 2013. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. JMLR(2013).

[4]

James S Bergstra, Rémi Bardenet, Yoshua Bengio, and Balázs Kégl. 2011. Algorithms for hyper-parameter optimization. In Advances in neural information processing systems. 2546--2554.

[5]

Leo Breiman. 1996. Bagging Predictors. Mach. Learn.24, 2 (Aug. 1996), 123--140. https://doi.org/10.1023/A:1018054314350

Digital Library

[6]

Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5--32.

Digital Library

[7]

Han Cai, Ligeng Zhu, and Song Han. 2018. Proxylessnas: Direct neural architecture search on target task and hardware.arXiv:1812.00332(2018).

[8]

Leo Anthony Celi, Roger G Mark, David J Stone, and Robert A Montgomery. 2013. "Big data" in the intensive care unit. Closing the data loop.American journal of respiratory and critical care medicine 187, 11 (2013), 1157.

[9]

Daniel Crankshaw, Xin Wang, Giulio Zhou, Michael J. Franklin, Joseph E. Gonzalez, and Ion Stoica. 2016. Clipper: A Low-Latency Online Prediction Serving System. CoRRabs/1612.03079 (2016). arXiv:1612.03079 http://arxiv.org/abs/1612.03079

[10]

Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. 2018. Neural architecture search: A survey. arXiv preprint arXiv:1808.05377(2018).

[11]

Hayley B Gershengorn, Allan Garland, and Michelle N Gong. 2015. Patterns of daily costs differ for medical and surgical intensive care unit patients.Annals of the American Thoracic Society 12, 12 (2015), 1831--1836.

[12]

Varun Gulshan, Lily Peng, Marc Coram, Martin C. Stumpe, Derek Wu, Arunachalam Narayanaswamy, Subhashini Venugopalan, Kasumi Widner, Tom Madams,Jorge Cuadros, Ramasamy Kim, Rajiv Raman, Philip C. Nelson, Jessica L. Mega,and Dale R. Webster. 2016. Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA 316, 22 (12 2016), 2402--2410. https://doi.org/10.1001/jama.2016.17216arXiv:https://jamanetwork.com/journals/jama/articlepdf/2588763/joi160132.pdf

[13]

Neil A Halpern and Stephen M Pastores. 2015. Critical care medicine beds, use, occupancy and costs in the United States: a methodological review. Critical care medicine 43, 11 (2015), 2452.

[14]

Awni Y Hannun, Pranav Rajpurkar, Masoumeh Haghpanahi, Geoffrey H Tison, Codie Bourn, Mintu P Turakhia, and Andrew Y Ng. 2019. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms usinga deep neural network. Nature medicine 25, 1 (2019), 65.

[15]

Hrayr Harutyunyan, Hrant Khachatrian, David C Kale, Greg Ver Steeg, and Aram Galstyan. 2017. Multitask learning and benchmarking with clinical time series data. arXiv preprint arXiv:1703.07771(2017).

[16]

Shenda Hong, Yuxi Zhou, Junyuan Shang, Cao Xiao, and Jimeng Sun. 2020. Opportunities and challenges of deep learning methods for electrocardiogram data: A systematic review. Computers in Biology and Medicine(2020), 103801.

[17]

Frank Hutter, Holger H Hoos, and Kevin Leyton-Brown. 2011. Sequential model-based optimization for general algorithm configuration. In International conference on learning and intelligent optimization. Springer, 507--523.

Digital Library

[18]

Slawomir Koziel and Leifur Leifsson. 2013. Surrogate-based modeling and optimization. Springer.

[19]

Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. nature 521, 7553 (2015), 436.

[20]

Julien-Charles Lévesque, Christian Gagné, and Robert Sabourin. 2016. Bayesian hyperparameter optimization for ensemble learning. In Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence. 437--446.

Digital Library

[21]

Zachary C Lipton, David C Kale, Charles Elkan, and Randall Wetzel. 2016. Learning to diagnose with LSTM recurrent neural networks. ICLR.

[22]

Chenxi Liu, Barret Zoph, Maxim Neumann, Jonathon Shlens, Wei Hua, Li-Jia Li,Li Fei-Fei, Alan Yuille, Jonathan Huang, and Kevin Murphy. 2018. Progressive neural architecture search. In ECCV. 19--34.

[23]

Xiangrui Meng, Joseph Bradley, Burak Yavuz, Evan Sparks, Shivaram Venkataraman, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen, Doris Xin, Reynold Xin, Michael J. Franklin, Reza Zadeh, Matei Zaharia, and Ameet Talwalkar. 2015. MLlib: Machine Learning in Apache Spark. arXiv:1505.06807 [cs.LG]

[24]

Jonas Mockus. 2012.Bayesian approach to global optimization: theory and applications. Vol. 37. Springer Science & Business Media.

[25]

Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, RichardLiaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I Jordan,et al. 2018. Ray: A distributed framework for emerging AI applications. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI'18). 561--577.

[26]

Phuoc Nguyen, Truyen Tran, and Svetha Venkatesh. 2017. Deep learning to attend to risk in ICU. arXiv preprint arXiv: 1707.05010(2017).

[27]

Christopher Olston, Noah Fiedel, Kiril Gorovoy, Jeremiah Harmsen, Li Lao, Fang-wei Li, Vinu Rajashekhar, Sukriti Ramesh, and Jordan Soyke. 2017. Tensorflow-serving: Flexible, high-performance ml serving. arXiv:1712.06139(2017).

[28]

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems. 8024--8035.

[29]

Alvin Rajkomar, Eyal Oren, Kai Chen, Andrew M Dai, Nissan Hajaj, Michaela Hardt, Peter J Liu, Xiaobing Liu, Jake Marcus, Mimi Sun, et al.2018. Scalable and accurate deep learning with electronic health records. NPJ Digital Medicine 1, 1(2018), 18.

[30]

Pranav Rajpurkar, Jeremy Irvin, Kaylie Zhu, Brandon Yang, Hershel Mehta, Tony Duan, Daisy Ding, Aarti Bagul, Curtis Langlotz, Katie Shpanskaya, et al. 2017. Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv preprint arXiv:1711.05225(2017).

[31]

Bobak Shahriari, Kevin Swersky, Ziyu Wang, Ryan P Adams, and Nando De Freitas. 2015. Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 104, 1 (2015), 148--175.

[32]

Jasper Snoek, Hugo Larochelle, and Ryan P Adams. 2012. Practical bayesian optimization of machine learning algorithms. In Advances in neural information processing systems. 2951--2959.

[33]

Lu Wang, Wei Zhang, Xiaofeng He, and Hongyuan Zha. 2018. Supervised reinforcement learning with recurrent neural network for dynamic treatment recommendation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2447--2456.

Digital Library

[34]

Darrell Whitley. 1994. A genetic algorithm tutorial.Statistics and computing 4, 2(1994), 65--85.

[35]

Cao Xiao, Edward Choi, and Jimeng Sun. 2018. Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review. JAMIA25, 10 (2018), 1419--1428.

[36]

Saining Xie, Ross Girshick, Piotr Dollár, Zhuowen Tu, and Kaiming He. 2017. Aggregated residual transformations for deep neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1492--1500.

[37]

Yanbo Xu, Siddharth Biswal, Shriprasad R Deshpande, Kevin O Maher, and Jimeng Sun. 2018. Raim: Recurrent attentive and intensive model of multimodal patient monitoring data. In KDD. ACM, 2565--2573.

Digital Library

[38]

Zhi-Hua Zhou. 2012. Ensemble methods: foundations and algorithms. Chapman and Hall/CRC.

Digital Library

[39]

Barret Zoph and Quoc V Le. 2016. Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578(2016).

Cited By

Lei JZhou YTian XZhao QZhang QGeng SWu QHong S(2025)A deep learning method for beat-level risk analysis and interpretation of atrial fibrillation patients during sinus rhythmBiomedical Signal Processing and Control10.1016/j.bspc.2024.107028100(107028)Online publication date: Feb-2025
https://doi.org/10.1016/j.bspc.2024.107028
Liu XSha ZLiang J(2024)Multibranch Block-Based Grain Size Classification Of Hybrid Disk Using Ultrasonic Scattering: A Deep Learning MethodMaterials Evaluation10.32548/2024.me-0438882:4(38-51)Online publication date: 1-Apr-2024
https://doi.org/10.32548/2024.me-04388
Xue ZGeng SGuo SMu GYu BWang PHu SZhang DXu WLiu YYang LTao HHong SChen K(2024)Screening for severe coronary stenosis in patients with apparently normal electrocardiograms based on deep learningBMC Medical Informatics and Decision Making10.1186/s12911-024-02764-024:1Online publication date: 22-Nov-2024
https://doi.org/10.1186/s12911-024-02764-0
Show More Cited By

Recommendations

Measuring the awareness of health care providers at Benghazi Medical Center for Health Informatics
ICEMIS'20: Proceedings of the 6th International Conference on Engineering & MIS 2020

Health informatics has led to many changes in the health service delivery system. Awareness of health informatics is important and crucial to modern physicians. In general today there is a lack of awareness and understanding of health informatics among ...
On-demand Virtual Neonatal Intensive Care units supporting rural, remote and urban healthcare with Bush Babies Broadband

Premature and ill term babies born in regional Australia must be moved to another hospital with Neonatal Intensive Care Unit (NICU) facilities. Transferred pre-term and critically ill term babies have higher mortality rates and much higher rates of long ...
Data Mining Models to Predict Patientźs Readmission in Intensive Care Units
ICAART 2014: Proceedings of the 6th International Conference on Agents and Artificial Intelligence - Volume 1

Decision making is one of the most critical points in intensive care units and at this stage it is extremely difficult for health professionals to interpret almost instantly all the data available. In order to help the decision process, classification ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

August 2020

3664 pages

ISBN:9781450379984

DOI:10.1145/3394486

General Chairs:
Rajesh Gupta
UC San Diego, USA
,
Yan Liu
USC, USA
,
Program Chairs:
Mohak Shah
LG Electronics, USA
,
Suju Rajan
Linkedin, USA
,
Publications Chairs:
Jiliang Tang
Michigan State, USA
,
B. Aditya Prakash
Georgia Tech, USA

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 August 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Institute of Health award
National Science Foundation award

Conference

KDD '20

Sponsor:

KDD '20: The 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

July 6 - 10, 2020

CA, Virtual Event, USA

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

42
Total Citations
View Citations
880
Total Downloads

Downloads (Last 12 months)97
Downloads (Last 6 weeks)16

Reflects downloads up to 24 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Lei JZhou YTian XZhao QZhang QGeng SWu QHong S(2025)A deep learning method for beat-level risk analysis and interpretation of atrial fibrillation patients during sinus rhythmBiomedical Signal Processing and Control10.1016/j.bspc.2024.107028100(107028)Online publication date: Feb-2025
https://doi.org/10.1016/j.bspc.2024.107028
Liu XSha ZLiang J(2024)Multibranch Block-Based Grain Size Classification Of Hybrid Disk Using Ultrasonic Scattering: A Deep Learning MethodMaterials Evaluation10.32548/2024.me-0438882:4(38-51)Online publication date: 1-Apr-2024
https://doi.org/10.32548/2024.me-04388
Xue ZGeng SGuo SMu GYu BWang PHu SZhang DXu WLiu YYang LTao HHong SChen K(2024)Screening for severe coronary stenosis in patients with apparently normal electrocardiograms based on deep learningBMC Medical Informatics and Decision Making10.1186/s12911-024-02764-024:1Online publication date: 22-Nov-2024
https://doi.org/10.1186/s12911-024-02764-0
Shome DGhahjaverestan NEtemad A(2024)NapTune: Efficient Model Tuning for Mood Classification using Previous Night's Sleep Measures along with Wearable Time-seriesProceedings of the 26th International Conference on Multimodal Interaction10.1145/3678957.3685722(204-213)Online publication date: 4-Nov-2024
https://dl.acm.org/doi/10.1145/3678957.3685722
Sun CLi HSong MHong S(2024)A Ranking-Based Cross-Entropy Loss for Early Classification of Time SeriesIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.325020335:8(11194-11203)Online publication date: Aug-2024
https://doi.org/10.1109/TNNLS.2023.3250203
Li JLei ZZhang ZLi HTodo YGao S(2024)Alternating Excitation–Inhibition Dendritic Computing for ClassificationIEEE Transactions on Artificial Intelligence10.1109/TAI.2024.34162365:11(5431-5441)Online publication date: Nov-2024
https://doi.org/10.1109/TAI.2024.3416236
Yin WDong GAn DZhao YWang B(2024)Easy Pruning via Coresets and Structural Re-ParameterizationIEEE Signal Processing Letters10.1109/LSP.2024.341941931(1725-1729)Online publication date: 2024
https://doi.org/10.1109/LSP.2024.3419419
Lee JHan SLee J(2024)D2NAS: Efficient Neural Architecture Search With Performance Improvement and Model Size Reduction for Diverse TasksIEEE Access10.1109/ACCESS.2024.343474312(127074-127085)Online publication date: 2024
https://doi.org/10.1109/ACCESS.2024.3434743
Lin ZLiu XWang NLi RLiu QMa JWang LWang YHong S(2024)Deep learning with information fusion and model interpretation for long-term prenatal fetal heart rate datanpj Women's Health10.1038/s44294-024-00033-z2:1Online publication date: 3-Sep-2024
https://doi.org/10.1038/s44294-024-00033-z
Ding ZHuang YZeng XJiang SFeng SWang ZWang LWang ZXu YLiu Y(2024)Contrastive voxel clustering for multiscale modeling of brain networkNeuroImage10.1016/j.neuroimage.2024.120755297(120755)Online publication date: Aug-2024
https://doi.org/10.1016/j.neuroimage.2024.120755
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents