Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

POBO: : Safe and optimal resource management for cloud microservices

Published: 01 November 2023 Publication History

Abstract

Resource management in microservices is challenging due to the uncertain latency–resource relationship, dynamic environment, and strict Service-Level Agreement (SLA) guarantees. This paper presents a Pessimistic and Optimistic Bayesian Optimization framework, named POBO, for safe and optimal resource configuration for microservice applications. POBO leverages Bayesian learning to estimate the uncertain latency–resource functions and combines primal–dual and penalty-based optimization to maximize resource efficiency while guaranteeing strict SLAs. We prove that POBO can achieve sublinear regret and SLA violation against the optimal resource configuration in hindsight. We have implemented a prototype of POBO and conducted extensive experiments on a real-world microservice application. Our results show that POBO can find the safe and optimal configuration efficiently, outperforming Kubernetes’ built-in auto-scaling module and the state-of-the-art algorithms.

References

[1]
Alshuqayran N., Ali N., Evans R., A systematic mapping study in microservice architecture, in: 2016 IEEE 9th International Conference on Service-Oriented Computing and Applications, SOCA, IEEE, 2016, pp. 44–51.
[2]
[3]
Amazon web services, 2023, URL https://aws.amazon.com.
[4]
[5]
J. Park, B. Choi, C. Lee, D. Han, GRAF: A graph neural network based proactive resource allocation framework for SLO-oriented microservices, in: Proceedings of the 17th International Conference on Emerging Networking EXperiments and Technologies, 2021, pp. 154–167.
[6]
Y. Zhang, W. Hua, Z. Zhou, G.E. Suh, C. Delimitrou, Sinan: ML-based and QoS-aware resource management for cloud microservices, in: Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021, pp. 167–181.
[7]
S. Luo, H. Xu, K. Ye, G. Xu, L. Zhang, J. He, G. Yang, C. Xu, Erms: Efficient Resource Management for Shared Microservices with SLA Guarantees, in: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1, 2022, pp. 62–77.
[8]
Li Q., Li B., Mercati P., Illikkal R., Tai C., Kishinevsky M., Kozyrakis C., RAMBO: Resource allocation for microservices using Bayesian optimization, IEEE Comput. Archit. Lett. 20 (1) (2021) 46–49.
[9]
Patel T., Tiwari D., Clite: Efficient and qos-aware co-location of multiple latency-critical jobs for warehouse scale computers, in: 2020 IEEE International Symposium on High Performance Computer Architecture, HPCA, IEEE, 2020, pp. 193–206.
[10]
Roy R.B., Patel T., Tiwari D., Satori: efficient and fair resource partitioning by sacrificing short-term benefits for long-term gains, in: 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture, ISCA, IEEE, 2021, pp. 292–305.
[11]
Y. Liu, H. Xu, W.C. Lau, Online Resource Optimization for Elastic Stream Processing with Regret Guarantee, in: Proceedings of the 51st International Conference on Parallel Processing, 2022, pp. 1–11.
[12]
Z. Zhou, Y. Zhang, C. Delimitrou, AQUATOPE: QoS-and-Uncertainty-Aware Resource Management for Multi-stage Serverless Workflows, in: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1, 2022, pp. 1–14.
[13]
X. Zhang, H. Wu, Y. Li, J. Tan, F. Li, B. Cui, Towards dynamic and safe configuration tuning for cloud databases, in: Proceedings of the 2022 International Conference on Management of Data, 2022, pp. 631–645.
[14]
X. Zhou, B. Ji, On Kernelized Multi-Armed Bandits with Constraints, in: Thirty-Sixth Conference on Neural Information Processing Systems, 2022.
[15]
Guo H., Zhu Q., Liu X., Rectified pessimistic-optimistic learning for stochastic continuum-armed bandit with constraints, 2022, arXiv preprint arXiv:2211.14720.
[16]
Y. Gan, Y. Zhang, D. Cheng, A. Shetty, P. Rathi, N. Katarki, A. Bruno, J. Hu, B. Ritchken, B. Jackson, et al., An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems, in: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, 2019, pp. 3–18.
[17]
Chowdhury S.R., Gopalan A., On kernelized multi-armed bandits, in: International Conference on Machine Learning, PMLR, 2017, pp. 844–853.
[19]
Liu X., Li B., Shi P., Ying L., An efficient pessimistic-optimistic algorithm for stochastic linear bandits with general constraints, Adv. Neural Inf. Process. Syst. 34 (2021) 24075–24086.
[20]
Neely M.J., Energy-aware wireless scheduling with near-optimal backlog and convergence time tradeoffs, IEEE/ACM Trans. Netw. 24 (4) (2015) 2223–2236.
[22]
[23]
[24]
Gan Y., Zhang Y., Cheng D., Shetty A., Rathi P., Katarki N., Bruno A., Hu J., Ritchken B., Jackson B., Hu K., Pancholi M., He Y., Clancy B., Colen C., Wen F., Leung C., Wang S., Zaruvinsky L., Espinosa M., Lin R., Liu Z., Padilla J., Delimitrou C., An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems, in: Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, ACM, 2019, pp. 3–18.
[27]
Q. Weng, W. Xiao, Y. Yu, W. Wang, C. Wang, J. He, Y. Li, L. Zhang, W. Lin, Y. Ding, MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters, in: 19th USENIX Symposium on Networked Systems Design and Implementation, NSDI 2022, Renton, WA, USA, April 4-6, 2022, 2022, pp. 945–960.
[28]
N. Srinivas, A. Krause, S.M. Kakade, M. Seeger, Gaussian process optimization in the bandit setting: No regret and experimental design, in: Proceedings of the International Conference on Machine Learning, 2010, 2010.
[29]
Hajek B., Hitting-time and occupation-time bounds implied by drift analysis with applications, Adv. Appl. Probab. 14 (3) (1982) 502–525.

Index Terms

  1. POBO: Safe and optimal resource management for cloud microservices
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Please enable JavaScript to view thecomments powered by Disqus.

          Information & Contributors

          Information

          Published In

          cover image Performance Evaluation
          Performance Evaluation  Volume 162, Issue C
          Nov 2023
          414 pages

          Publisher

          Elsevier Science Publishers B. V.

          Netherlands

          Publication History

          Published: 01 November 2023

          Author Tags

          1. Microservice resource management
          2. Service-Level Agreement
          3. Safe Bayesian optimization
          4. Primal–dual optimization
          5. Penalty-based design

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 0
            Total Downloads
          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 13 Feb 2025

          Other Metrics

          Citations

          View Options

          View options

          Figures

          Tables

          Media

          Share

          Share

          Share this Publication link

          Share on social media