Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3458817.3476142acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article
Public Access

The hidden cost of the edge: a performance comparison of edge and cloud latencies

Published: 13 November 2021 Publication History

Abstract

Edge computing has emerged as a popular paradigm for running latency-sensitive applications due to its ability to offer lower network latencies to end-users. In this paper, we argue that despite its lower network latency, the resource-constrained nature of the edge can result in higher end-to-end latency, especially at higher utilizations, when compared to cloud data centers. We study this edge performance inversion problem through an analytic comparison of edge and cloud latencies and analyze conditions under which the edge can yield worse performance than the cloud. To verify our analytic results, we conduct a detailed experimental comparison of the edge and the cloud latencies using a realistic application and real cloud workloads. Both our analytical and experimental results show that even at moderate utilizations, the edge queuing delays can offset the benefits of lower network latencies, and even result in performance inversion where running in the cloud would provide superior latencies. We finally discuss practical implications of our results and provide insights into how application designers and service providers should design edge applications and systems to avoid these pitfalls.

Supplementary Material

MP4 File (Cloud and Edge Computing - The Hidden Cost of the Edge_ A Performance Comparison of Edge and Cloud Latencies.mp4)
Presentation video

References

[1]
Azure IoT edge. https://azure.microsoft.com/en-us/services/iot-edge/.
[2]
F. Ahmad and T. Vijaykumar. Joint optimization of idle and cooling power in data centers while maintaining response time. In ACM Sigplan Notices, volume 45, pages 243--256. ACM, 2010.
[3]
G. Bolch, S. Greiner, H. De Meer, and K. S. Trivedi. Queueing networks and Markov chains: modeling and performance evaluation with computer science applications. John Wiley & Sons, 2006.
[4]
D. Bruneo. A stochastic model to investigate data center performance and qos in iaas cloud computing systems. IEEE Transactions on Parallel and Distributed Systems, 25(3):560--569, 2014.
[5]
A. Gandhi, S. Doroudi, M. Harchol-Balter, and A. Scheller-Wolf. Exact analysis of the m/m/k/setup class of markov chains via recursive renewal reward. In ACM SIGMETRICS Performance Evaluation Review, volume 41, pages 153--166. ACM, 2013.
[6]
A. Gandhi, V. Gupta, M. Harchol-Balter, and M. A. Kozuch. Optimality analysis of energy-performance trade-off for server farm management. Performance Evaluation, 67(11):1155--1171, 2010.
[7]
M. C. Gonzalez, C. A. Hidalgo, and A.-L. Barabasi. Understanding individual human mobility patterns. nature, 453(7196):779, 2008.
[8]
M. Harchol-Balter. Performance modeling and design of computer systems: queueing theory in action, chapter 14. Cambridge University Press, 2013.
[9]
Y.-J. Hong, J. Xue, and M. Thottethodi. Dynamic server provisioning to minimize cost in an iaas cloud. In Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, pages 147--148. ACM, 2011.
[10]
J. Hsu. How youtube led to google's cloud-gaming service: The tech that made youtube work everywhere promises to do the same for games-[news]. IEEE Spectrum, 56(09):9--10, 2019.
[11]
W. Hu, Y. Gao, K. Ha, J. Wang, B. Amos, Z. Chen, P. Pillai, and M. Satyanarayanan. Quantifying the impact of edge computing on mobile applications. In Proceedings of the 7th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys '16, pages 5:1--5:8, New York, NY, USA, 2016. ACM.
[12]
S. Kelly-Bootle and B. W. Lutek. Chapter 5 - queueing theory. In A. O. Allen, editor, Probability, Statistics, and Queuing Theory with Computer Science Applications (Second Edition), Computer Science and Scientific Computing, pages 247 -- 375. Academic Press, San Diego, second edition edition, 1990.
[13]
A. Khan, X. Yan, S. Tao, and N. Anerousis. Workload characterization and prediction in the cloud: A multiple time series approach. In 2012 IEEE Network Operations and Management Symposium, pages 1287--1294. IEEE, 2012.
[14]
H. Khazaei, J. Misic, and V. B. Misic. Performance analysis of cloud computing centers using m/g/m/m+ r queuing systems. IEEE Transactions on parallel and distributed systems, 23(5):936--943, 2012.
[15]
H. Khazaei, J. Mišić, V. B. Mišić, and S. Rashwand. Analysis of a pool management scheme for cloud computing centers. IEEE Transactions on parallel and distributed systems, 24(5):849--861, 2013.
[16]
J. Kingman. Inequalities in the theory of queues. Journal of the Royal Statistical Society. Series B (Methodological), pages 102--110, 1970.
[17]
L. Kleinrock. Theory, volume 1, queueing systems, 1975.
[18]
C. N. Le Tan, C. Klein, and E. Elmroth. Location-aware load prediction in edge datacenters. In Fog and Mobile Edge Computing (FMEC), 2017 Second International Conference on, pages 25--31. IEEE, 2017.
[19]
J. Li, N. K. Sharma, D. R. Ports, and S. D. Gribble. Tales of the tail: Hardware, os, and application-level sources of tail latency. In Proceedings of the ACM Symposium on Cloud Computing, pages 1--14. ACM, 2014.
[20]
C. Lu, K. Ye, G. Xu, C.-Z. Xu, and T. Bai. Imbalance in the cloud: An analysis on alibaba cluster trace. In 2017 IEEE International Conference on Big Data (Big Data), pages 2884--2892. IEEE, 2017.
[21]
Y. Mao, J. Zhang, S. Song, and K. B. Letaief. Power-delay tradeoff in multi-user mobile-edge computing systems. In IEEE Global Communications Conference (GLOBECOM), pages 1--6. IEEE, 2016.
[22]
E. Nygren, R. K. Sitaraman, and J. Sun. The akamai network: a platform for high-performance internet applications. ACM SIGOPS Operating Systems Review, 44(3):2--19, 2010.
[23]
M. Piorkowski, N. Sarafijanovic-Djukic, and M. Grossglauser. CRAWDAD data set EPFL/mobility. https://crawdad.org/epfl/mobility/20090224/.
[24]
M. H. Rothkopf and P. Rech. Perspectives on queues: Combining queues is not always beneficial. Operations Research, 35(6):906--909, 1987.
[25]
M. Satyanarayanan. Pervasive computing: Vision and challenges. IEEE Personal communications, 8(4):10--17, 2001.
[26]
M. Satyanarayanan. The emergence of edge computing. Computer, 50(1):30--39, 2017.
[27]
M. Satyanarayanan, V. Bahl, R. Caceres, and N. Davies. The case for VM-based cloudlets in mobile computing. IEEE pervasive Computing, 2009.
[28]
M. Shahrad, R. Fonseca, Í. Goiri, G. Chaudhry, P. Batum, J. Cooke, E. Laureano, C. Tresness, M. Russinovich, and R. Bianchini. Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. In 2020 {USENIX} Annual Technical Conference ({USENIX}{ATC} 20), pages 205--218, 2020.
[29]
L. Suresh, M. Canini, S. Schmid, and A. Feldmann. C3: Cutting tail latency in cloud data stores via adaptive replica selection. In 12th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 15), pages 513--527, 2015.
[30]
L. Tong, Y. Li, and W. Gao. A hierarchical edge cloud architecture for mobile computing. In INFOCOM 2016-The 35th Annual IEEE International Conference on Computer Communications, IEEE, pages 1--9. IEEE, 2016.
[31]
J. N. Tsitsiklis and K. Xu. On the power of (even a little) centralization in distributed processing. ACM SIGMETRICS Performance Evaluation Review, 39(1):121--132, 2011.
[32]
J. N. Tsitsiklis and K. Xu. On the power of (even a little) resource pooling. Stochastic Systems, 2(1):1--66, 2012.
[33]
R. Urgaonkar, S. Wang, T. He, M. Zafer, K. Chan, and K. K. Leung. Dynamic service migration and workload scheduling in edge-clouds. Performance Evaluation, 91:205--228, 2015.
[34]
M. Weiser. The computer for the 21st century. Scientific american, 265(3):94--104, 1991.
[35]
W. Whitt. Understanding the efficiency of multi-server service systems. Management Science, 38(5):708--723, 1992.
[36]
W. Whitt. "approximations for the gi/g/m queue". Production and Operations Management, 2(2):114--161, 1993.
[37]
J. Xue, R. Birke, L. Y. Chen, and E. Smirni. Spatial-temporal prediction models for active ticket managing in data centers. IEEE Transactions on Network and Service Management, 15(1):39--52, 2018.

Cited By

View all
  • (2024)Computing-aware network (CAN): a systematic design of computing and network convergence算力感知网络:一种算网一体的系统设计Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.240009825:5(633-644)Online publication date: 7-Jun-2024
  • (2024)INVAR: Inversion Aware Resource Provisioning and Workload Scheduling for Edge ComputingIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621417(1511-1520)Online publication date: 20-May-2024
  • (2023)Edge Computing Research – A ReviewJournal of Information Technology and Digital World10.36548/jitdw.2023.1.0055:1(62-74)Online publication date: Mar-2023
  • Show More Cited By

Index Terms

  1. The hidden cost of the edge: a performance comparison of edge and cloud latencies
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    SC '21: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
    November 2021
    1493 pages
    ISBN:9781450384421
    DOI:10.1145/3458817
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Sponsors

    In-Cooperation

    • IEEE CS

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 13 November 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Badges

    Qualifiers

    • Research-article

    Funding Sources

    Conference

    SC '21
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)350
    • Downloads (Last 6 weeks)37
    Reflects downloads up to 21 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Computing-aware network (CAN): a systematic design of computing and network convergence算力感知网络:一种算网一体的系统设计Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.240009825:5(633-644)Online publication date: 7-Jun-2024
    • (2024)INVAR: Inversion Aware Resource Provisioning and Workload Scheduling for Edge ComputingIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621417(1511-1520)Online publication date: 20-May-2024
    • (2023)Edge Computing Research – A ReviewJournal of Information Technology and Digital World10.36548/jitdw.2023.1.0055:1(62-74)Online publication date: Mar-2023
    • (2023)Better Orchestration for SLO-Oriented Cross-site Microservices in Multi-tenant Cloud/Edge ContinuumProceedings of the 24th International Middleware Conference: Demos, Posters and Doctoral Symposium10.1145/3626564.3629091(9-10)Online publication date: 11-Dec-2023
    • (2023)Service Caching and Computation Reuse Strategies at the Edge: A SurveyACM Computing Surveys10.1145/360950456:2(1-38)Online publication date: 14-Sep-2023
    • (2023)RAVAS: Interference-Aware Model Selection and Resource Allocation for Live Edge Video AnalyticsProceedings of the Eighth ACM/IEEE Symposium on Edge Computing10.1145/3583740.3628443(27-39)Online publication date: 6-Dec-2023
    • (2023)Cost Optimization for Cloud Storage from User Perspectives: Recent Advances, Taxonomy, and SurveyACM Computing Surveys10.1145/358288355:13s(1-37)Online publication date: 13-Jul-2023
    • (2023)Graph3PO: A Temporal Graph Data Processing Method for Latency QoS Guarantee in Object Cloud Storage SystemProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607075(1-16)Online publication date: 12-Nov-2023
    • (2023)EdgeAdaptor: Online Configuration Adaption, Model Selection and Resource Provisioning for Edge DNN Inference Serving at ScaleIEEE Transactions on Mobile Computing10.1109/TMC.2022.318918622:10(5870-5886)Online publication date: 1-Oct-2023
    • (2023)Unlocking Efficiency: Understanding End-to-End Performance in Distributed Analytics PipelinesMILCOM 2023 - 2023 IEEE Military Communications Conference (MILCOM)10.1109/MILCOM58377.2023.10356378(150-155)Online publication date: 30-Oct-2023
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media