Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3394486.3403212acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units

Published: 20 August 2020 Publication History

Abstract

Deep learning models have achieved expert-level performance in healthcare with an exclusive focus on training accurate models. However, in many clinical environments such as intensive care unit (ICU), real-time model serving is equally if not more important than accuracy, because in ICU patient care is simultaneously more urgent and more expensive. Clinical decisions and their timeliness, therefore, directly affect both the patient outcome and the cost of care. To make timely decisions, we argue the underlying serving system must be latency-aware. To compound the challenge, health analytic applications often require a combination of models instead of a single model, to better specialize individual models for different targets, multi-modal data, different prediction windows, and potentially personalized predictions. To address these challenges, we propose HOLMES---an online model ensemble serving framework for healthcare applications. HOLMES dynamically identifies the best performing set of models to ensemble for highest accuracy, while also satisfying sub-second latency constraints on end-to-end prediction. We demonstrate that HOLMES is able to navigate the accuracy/latency tradeoff efficiently, compose the ensemble, and serve the model ensemble pipeline, scaling to simultaneously streaming data from 100 patients, each producing waveform data at 250~Hz. HOLMES outperforms the conventional offline batch-processed inference for the same clinical task in terms of accuracy and latency (by order of magnitude). HOLMES is tested on risk prediction task on pediatric cardio ICU data with above 95% prediction accuracy and sub-second latency on 64-bed simulation.

References

[1]
Sébastien Bailly, Geert Meyfroidt, and Jean-François Timsit. 2018. What's new in ICU in 2050: big data and machine learning. Intensive care medicine 44, 9 (2018),1524--1527.
[2]
James Bergstra and Yoshua Bengio. 2012. Random search for hyper-parameter optimization. Journal of machine learning research 13, Feb (2012), 281--305.
[3]
James Bergstra, Daniel Yamins, and David Daniel Cox. 2013. Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures. JMLR(2013).
[4]
James S Bergstra, Rémi Bardenet, Yoshua Bengio, and Balázs Kégl. 2011. Algorithms for hyper-parameter optimization. In Advances in neural information processing systems. 2546--2554.
[5]
Leo Breiman. 1996. Bagging Predictors. Mach. Learn.24, 2 (Aug. 1996), 123--140. https://doi.org/10.1023/A:1018054314350
[6]
Leo Breiman. 2001. Random forests. Machine learning 45, 1 (2001), 5--32.
[7]
Han Cai, Ligeng Zhu, and Song Han. 2018. Proxylessnas: Direct neural architecture search on target task and hardware.arXiv:1812.00332(2018).
[8]
Leo Anthony Celi, Roger G Mark, David J Stone, and Robert A Montgomery. 2013. "Big data" in the intensive care unit. Closing the data loop.American journal of respiratory and critical care medicine 187, 11 (2013), 1157.
[9]
Daniel Crankshaw, Xin Wang, Giulio Zhou, Michael J. Franklin, Joseph E. Gonzalez, and Ion Stoica. 2016. Clipper: A Low-Latency Online Prediction Serving System. CoRRabs/1612.03079 (2016). arXiv:1612.03079 http://arxiv.org/abs/1612.03079
[10]
Thomas Elsken, Jan Hendrik Metzen, and Frank Hutter. 2018. Neural architecture search: A survey. arXiv preprint arXiv:1808.05377(2018).
[11]
Hayley B Gershengorn, Allan Garland, and Michelle N Gong. 2015. Patterns of daily costs differ for medical and surgical intensive care unit patients.Annals of the American Thoracic Society 12, 12 (2015), 1831--1836.
[12]
Varun Gulshan, Lily Peng, Marc Coram, Martin C. Stumpe, Derek Wu, Arunachalam Narayanaswamy, Subhashini Venugopalan, Kasumi Widner, Tom Madams,Jorge Cuadros, Ramasamy Kim, Rajiv Raman, Philip C. Nelson, Jessica L. Mega,and Dale R. Webster. 2016. Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA 316, 22 (12 2016), 2402--2410. https://doi.org/10.1001/jama.2016.17216arXiv:https://jamanetwork.com/journals/jama/articlepdf/2588763/joi160132.pdf
[13]
Neil A Halpern and Stephen M Pastores. 2015. Critical care medicine beds, use, occupancy and costs in the United States: a methodological review. Critical care medicine 43, 11 (2015), 2452.
[14]
Awni Y Hannun, Pranav Rajpurkar, Masoumeh Haghpanahi, Geoffrey H Tison, Codie Bourn, Mintu P Turakhia, and Andrew Y Ng. 2019. Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms usinga deep neural network. Nature medicine 25, 1 (2019), 65.
[15]
Hrayr Harutyunyan, Hrant Khachatrian, David C Kale, Greg Ver Steeg, and Aram Galstyan. 2017. Multitask learning and benchmarking with clinical time series data. arXiv preprint arXiv:1703.07771(2017).
[16]
Shenda Hong, Yuxi Zhou, Junyuan Shang, Cao Xiao, and Jimeng Sun. 2020. Opportunities and challenges of deep learning methods for electrocardiogram data: A systematic review. Computers in Biology and Medicine(2020), 103801.
[17]
Frank Hutter, Holger H Hoos, and Kevin Leyton-Brown. 2011. Sequential model-based optimization for general algorithm configuration. In International conference on learning and intelligent optimization. Springer, 507--523.
[18]
Slawomir Koziel and Leifur Leifsson. 2013. Surrogate-based modeling and optimization. Springer.
[19]
Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. 2015. Deep learning. nature 521, 7553 (2015), 436.
[20]
Julien-Charles Lévesque, Christian Gagné, and Robert Sabourin. 2016. Bayesian hyperparameter optimization for ensemble learning. In Proceedings of the Thirty-Second Conference on Uncertainty in Artificial Intelligence. 437--446.
[21]
Zachary C Lipton, David C Kale, Charles Elkan, and Randall Wetzel. 2016. Learning to diagnose with LSTM recurrent neural networks. ICLR.
[22]
Chenxi Liu, Barret Zoph, Maxim Neumann, Jonathon Shlens, Wei Hua, Li-Jia Li,Li Fei-Fei, Alan Yuille, Jonathan Huang, and Kevin Murphy. 2018. Progressive neural architecture search. In ECCV. 19--34.
[23]
Xiangrui Meng, Joseph Bradley, Burak Yavuz, Evan Sparks, Shivaram Venkataraman, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen, Doris Xin, Reynold Xin, Michael J. Franklin, Reza Zadeh, Matei Zaharia, and Ameet Talwalkar. 2015. MLlib: Machine Learning in Apache Spark. arXiv:1505.06807 [cs.LG]
[24]
Jonas Mockus. 2012.Bayesian approach to global optimization: theory and applications. Vol. 37. Springer Science & Business Media.
[25]
Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, RichardLiaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I Jordan,et al. 2018. Ray: A distributed framework for emerging AI applications. In 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI'18). 561--577.
[26]
Phuoc Nguyen, Truyen Tran, and Svetha Venkatesh. 2017. Deep learning to attend to risk in ICU. arXiv preprint arXiv: 1707.05010(2017).
[27]
Christopher Olston, Noah Fiedel, Kiril Gorovoy, Jeremiah Harmsen, Li Lao, Fang-wei Li, Vinu Rajashekhar, Sukriti Ramesh, and Jordan Soyke. 2017. Tensorflow-serving: Flexible, high-performance ml serving. arXiv:1712.06139(2017).
[28]
Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury, Gregory Chanan, Trevor Killeen, Zeming Lin, Natalia Gimelshein, Luca Antiga, et al. 2019. PyTorch: An imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems. 8024--8035.
[29]
Alvin Rajkomar, Eyal Oren, Kai Chen, Andrew M Dai, Nissan Hajaj, Michaela Hardt, Peter J Liu, Xiaobing Liu, Jake Marcus, Mimi Sun, et al.2018. Scalable and accurate deep learning with electronic health records. NPJ Digital Medicine 1, 1(2018), 18.
[30]
Pranav Rajpurkar, Jeremy Irvin, Kaylie Zhu, Brandon Yang, Hershel Mehta, Tony Duan, Daisy Ding, Aarti Bagul, Curtis Langlotz, Katie Shpanskaya, et al. 2017. Chexnet: Radiologist-level pneumonia detection on chest x-rays with deep learning. arXiv preprint arXiv:1711.05225(2017).
[31]
Bobak Shahriari, Kevin Swersky, Ziyu Wang, Ryan P Adams, and Nando De Freitas. 2015. Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 104, 1 (2015), 148--175.
[32]
Jasper Snoek, Hugo Larochelle, and Ryan P Adams. 2012. Practical bayesian optimization of machine learning algorithms. In Advances in neural information processing systems. 2951--2959.
[33]
Lu Wang, Wei Zhang, Xiaofeng He, and Hongyuan Zha. 2018. Supervised reinforcement learning with recurrent neural network for dynamic treatment recommendation. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. ACM, 2447--2456.
[34]
Darrell Whitley. 1994. A genetic algorithm tutorial.Statistics and computing 4, 2(1994), 65--85.
[35]
Cao Xiao, Edward Choi, and Jimeng Sun. 2018. Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review. JAMIA25, 10 (2018), 1419--1428.
[36]
Saining Xie, Ross Girshick, Piotr Dollár, Zhuowen Tu, and Kaiming He. 2017. Aggregated residual transformations for deep neural networks. In Proceedings of the IEEE conference on computer vision and pattern recognition. 1492--1500.
[37]
Yanbo Xu, Siddharth Biswal, Shriprasad R Deshpande, Kevin O Maher, and Jimeng Sun. 2018. Raim: Recurrent attentive and intensive model of multimodal patient monitoring data. In KDD. ACM, 2565--2573.
[38]
Zhi-Hua Zhou. 2012. Ensemble methods: foundations and algorithms. Chapman and Hall/CRC.
[39]
Barret Zoph and Quoc V Le. 2016. Neural architecture search with reinforcement learning. arXiv preprint arXiv:1611.01578(2016).

Cited By

View all
  • (2025)A deep learning method for beat-level risk analysis and interpretation of atrial fibrillation patients during sinus rhythmBiomedical Signal Processing and Control10.1016/j.bspc.2024.107028100(107028)Online publication date: Feb-2025
  • (2024)Multibranch Block-Based Grain Size Classification Of Hybrid Disk Using Ultrasonic Scattering: A Deep Learning MethodMaterials Evaluation10.32548/2024.me-0438882:4(38-51)Online publication date: 1-Apr-2024
  • (2024)Screening for severe coronary stenosis in patients with apparently normal electrocardiograms based on deep learningBMC Medical Informatics and Decision Making10.1186/s12911-024-02764-024:1Online publication date: 22-Nov-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '20: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining
August 2020
3664 pages
ISBN:9781450379984
DOI:10.1145/3394486
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 August 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data mining system
  2. health informatics
  3. healthcare
  4. software

Qualifiers

  • Research-article

Funding Sources

  • National Institute of Health award
  • National Science Foundation award

Conference

KDD '20
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)97
  • Downloads (Last 6 weeks)16
Reflects downloads up to 24 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2025)A deep learning method for beat-level risk analysis and interpretation of atrial fibrillation patients during sinus rhythmBiomedical Signal Processing and Control10.1016/j.bspc.2024.107028100(107028)Online publication date: Feb-2025
  • (2024)Multibranch Block-Based Grain Size Classification Of Hybrid Disk Using Ultrasonic Scattering: A Deep Learning MethodMaterials Evaluation10.32548/2024.me-0438882:4(38-51)Online publication date: 1-Apr-2024
  • (2024)Screening for severe coronary stenosis in patients with apparently normal electrocardiograms based on deep learningBMC Medical Informatics and Decision Making10.1186/s12911-024-02764-024:1Online publication date: 22-Nov-2024
  • (2024)NapTune: Efficient Model Tuning for Mood Classification using Previous Night's Sleep Measures along with Wearable Time-seriesProceedings of the 26th International Conference on Multimodal Interaction10.1145/3678957.3685722(204-213)Online publication date: 4-Nov-2024
  • (2024)A Ranking-Based Cross-Entropy Loss for Early Classification of Time SeriesIEEE Transactions on Neural Networks and Learning Systems10.1109/TNNLS.2023.325020335:8(11194-11203)Online publication date: Aug-2024
  • (2024)Alternating Excitation–Inhibition Dendritic Computing for ClassificationIEEE Transactions on Artificial Intelligence10.1109/TAI.2024.34162365:11(5431-5441)Online publication date: Nov-2024
  • (2024)Easy Pruning via Coresets and Structural Re-ParameterizationIEEE Signal Processing Letters10.1109/LSP.2024.341941931(1725-1729)Online publication date: 2024
  • (2024)D2NAS: Efficient Neural Architecture Search With Performance Improvement and Model Size Reduction for Diverse TasksIEEE Access10.1109/ACCESS.2024.343474312(127074-127085)Online publication date: 2024
  • (2024)Deep learning with information fusion and model interpretation for long-term prenatal fetal heart rate datanpj Women's Health10.1038/s44294-024-00033-z2:1Online publication date: 3-Sep-2024
  • (2024)Contrastive voxel clustering for multiscale modeling of brain networkNeuroImage10.1016/j.neuroimage.2024.120755297(120755)Online publication date: Aug-2024
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media