Abstract
Accurate prediction of cloud resource instances is becoming increasingly important for public cloud users and cloud service providers, because it touches on the reasonable reservation of cloud resources with minimize costs. However, current methods do not predict the instance types of cloud resources based on the application workloads from users, and less consider the characteristics of workload data changes in the real-time prediction. To solve these problems, this paper proposes an application workload-dependent cloud resource instance prediction model to predict appropriate cloud instance resource usage in a timely manner. Firstly, we adopt a trend degree (TD) to classify all requested workloads into three types of wave trend patterns. Next, a Hidden Markov model based cloud resource prediction method (HMM-CPM) tracing the requested workload trends is presented. Finally, the reasonable cloud instance types following the patterns of the requested workloads can be predicted. The simulation results show that the proposed method can predict cloud resource instance types in the scenario with certain workload fluctuation, and the prediction accuracy is higher than the existing related approaches.
Similar content being viewed by others
Data Availability
Enquiries about data availability should be directed to the authors.
References
Kashyap, S., Singh, A.: Prediction-based scheduling techniques for cloud data center’s workload: a systematic review. Clust. Comput. 26(5), 3209–3235 (2023)
Wang, X., Cao, J., Xiang, Y.: Dynamic cloud service selection using an adaptive learning mechanism in multi-cloud computing. J. Syst. Softw. 100, 195–210 (2015)
Reiss, C., Tumanov, A., Ganger, G.R., Katz, R.H., Kozuch, M.A.: Heterogeneity and dynamicity of clouds at scale: Google trace analysis. In: Acm Symposium on Cloud Computing (2012)
Chen, Z., Hu, J., Min, G., Zomaya, A.Y., El-Ghazawi, T.: Towards accurate prediction for high-dimensional and highly-variable cloud workloads with deep learning. IEEE Trans. Parallel Distrib. Syst. 31(4), 923–934 (2020)
Wang, X., Cao, J., Yang, D., Qin, Z., Buyya, R.: Online cloud resource prediction via scalable window waveform sampling on classified workloads. Futur. Gener. Comput. Syst. 117, 338–358 (2021)
Yadav, M.P., Rohit, Yadav, D.K.: Resource provisioning through machine learning in cloud services. Arabtan J. Sci. Eng. 47(2), 1483–1505 (2022)
Wei, L., Foh, C.H., He, B., Cai, J.: Towards efficient resource allocation for heterogeneous workloads in iaas clouds. IEEE Trans. Cloud Comput. 6(1), 264–275 (2018)
Wei, L., He, B., Foh, C.H.: Towards multi-resource physical machine provisioning for iaas clouds. In: Jamalipour, A., Deng, D. (eds.) 2014 IEEE International Conference on Communications (ICC), pp. 3469–3474 (2014)
Zhu, Y., Zhang, W., Chen, Y., Gao, H.: A novel approach to workload prediction using attention-based lstm encoder-decoder network in cloud environment. Eurasip J. Wirel. Commun. Netw. 2019(1), 274 (2019)
Sampaio, A.M., Barbosa, J.G., Prodan, R.: Piasa: a power and interference aware resource management strategy for heterogeneous workloads in cloud data centers. Simul. Model. Pract. Theory 57, 142–160 (2015)
Wang, Y., Yu, L., Teng, F., Song, J., Yuan, Y.: Resource load forecasting model based on long-short-term sequence feature fusion. J. Comput. Appl. 42(5), 1508–1515 (2022)
Hu, Y., Deng, B., Peng, F., Wang, D.: Workload prediction for cloud computing elasticity mechanism. In: 2016 IEEE International Conference on Cloud Computing and Big Data Analysis (ICCCBDA 2016), pp. 244–249 (2016)
Kumar, J., Singh, A.K.: Workload prediction in cloud using artificial neural network and adaptive differential evolution. Futur. Gener. Comput. Syst. 81, 41–52 (2018)
Khan, A., Yan, X., Tao, S., Anerousis, N.: Workload characterization and prediction in the cloud: a multiple time series approach. In: DeTurck, F., Gaspary, L., Medhi, D. (eds.) 2012 IEEE Network Operations and Management Symposium (NOMS), pp. 1287–1294 (2012)
Wang, H., Panneerselvam, J., Liu, L., Lu, Y., Zhai, X., Ali, H.: Cloud workload analytics for real-time prediction of user request patterns. In: 2018 IEEE 20th International Conference on High Performance Computing and Communications (HPCC), pp. 1677–1684 (2018)
Duggan, M., Mason, K., Duggan, J., Howley, E., Barrett, E.: Predicting host cpu utilization in cloud computing using recurrent neural networks. In: 2017 IEEE 12th International Conference for Internet Technology and Secured Transactions (ICITST), pp. 67–72 (2017)
Huang, Z., Peng, J., Lian, H., Guo, J., Qiu, W.: Deep recurrent model for server load and performance prediction in data center. Complexity (2017). https://doi.org/10.1155/2017/8584252
Xia, B., Li, T., Zhou, Q., Li, Q., Zhang, H.: An effective classification-based framework for predicting cloud capacity demand in cloud services. IEEE Trans. Serv. Comput. 14(4), 944–956 (2021)
Wang, X., Cao, J., Buyya, R.: Adaptive cloud bundle provisioning and multi-workflow scheduling via coalition reinforcement learning. IEEE Trans. Comput. 72(4), 1041–1054 (2023)
Hwang, R.-H., Lee, C.-N., Chen, Y.-R., Zhang-Jian, D.-J.: Cost optimization of elasticity cloud resource subscription policy. IEEE Trans. Serv. Comput. 7(4), 561–574 (2014)
Gupta, S., Dileep, A.D., Gonsalves, T.A.: A joint feature selection framework for multivariate resource usage prediction in cloud servers using stability and prediction performance. J. Supercomput. 74(11, SI), 6033–6068 (2018)
Pinciroli, R., Ali, A., Yan, F., Smirni, E.: Cedule+: Resource management for burstable cloud instances using predictive analytics. IEEE Trans. Netw. Serv. Manage. 18(1), 945–957 (2021)
Gupta, S., Dileep, A.D., Gonsalves, T.A.: Online sparse blstm models for resource usage prediction in cloud datacentres. IEEE Trans. Netw. Serv. Manage. 17(4), 2335–2349 (2020)
Nawrocki, P., Osypanka, P., Posluszny, B.: Data-driven adaptive prediction of cloud resource usage. J. Grid Comput. 21(1), 6 (2023)
Wang, S., Yao, Y., Xiao, Y., Chen, H.: Dynamic resource prediction in cloud computing for complex system simulatiuon: a probabilistic approach using stacking ensemble learning. In: 2020 International Conference on Intelligent Computing and Human-Computer Interaction (ICHCI), pp. 198–201 (2020)
Shen, H., Chen, L.: A resource-efficient predictive resource provisioning system in cloud systems. IEEE Trans. Parallel Distrib. Syst. 33(12), 3886–3900 (2022)
Rossi, A., Visentin, A., Prestwich, S., Brown, K.N.: Bayesian uncertainty modelling for cloud workload prediction. In: 2022 IEEE 15th International Conference on Cloud Computing (CLOUD), pp. 19–29 (2022)
Bao, L., Yang, J., Zhang, Z., Liu, W., Chen, J., Wu, C.: On accurate prediction of cloud workloads with adaptive pattern mining. J. Supercomput. 79(1), 160–187 (2023)
Liu, S., Jiang, C.: A novel prediction approach based on three-way decision for cloud datacenters. Appl. Intell. 53(17), 20239–20255 (2023)
Amazon: Amazon ec2 on-demand pricing. https://aws.amazon.com/ec2/pricing/on-demand (2023)
Wilkes, J.: Yet more Google compute cluster trace data. Google research blog, Mountain View, CA, USA. https://ai.googleblog.com/2020/04/yet-more-google-compute-cluster-trace.html (2020)
LinearModels: Scikit-learn. https://scikit-learn.org/stable/modules/linear_model.html (2020)
SupportVectorRegression: Scikit-learn. https://scikit-learn.org/stable/modules/svm.html (2020)
Baig, S.U.R., Iqbal, W., Berral, J.L., Erradi, A., Carrera, D.: Adaptive prediction models for data center resources utilization estimation. IEEE Trans. Netw. Serv. Manage. 16(4), 1681–1693 (2019)
Acknowledgements
This work is supported in part by the Natural Science Foundation of Shanghai Science and Technology Innovation Action Plan of China under Grant 22ZR1425300, the National Natural Science Foundation of China under Grant 61963017, the Shanghai Educational Science Research Project of China under Grant C2022056, the Shanghai Science and Technology Program of China under Grant 23010501000, and the Humanities and Social Sciences of Ministry of Education Planning Fund of China under Grant 22YJAZHA145.
Funding
The authors have not disclosed any funding.
Author information
Authors and Affiliations
Contributions
The authors confirm their contribution to the paper as follows: Yang Z.: Conceptualization, methodology, formal analysis, writing - original draft. Wang X.: Supervision, method guidance, writing - review and editing. Li R.: Algorithmic programming, data curation, validation. Liu Y.: Project coordination.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
The interpretation on the content extension
The preliminary work of this paper has been published at 2023 9th IEEE International Conference on Cloud Computing and Intelligence Systems (CCIS2023) under the title “Cloud Instance Resources Prediction Based on Hidden Markov Model”, and we have substantially modified and expanded the preliminary version of this paper about the model, algorithms, examples, experimental content and references for forming this journal standard version.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yang, Z., Wang, X., Li, R. et al. HMM-CPM: a cloud instance resource prediction method tracing the workload trends via hidden Markov model. Cluster Comput 27, 11823–11838 (2024). https://doi.org/10.1007/s10586-024-04580-7
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-024-04580-7