Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3458817.3476142acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
research-article
Public Access

The hidden cost of the edge: a performance comparison of edge and cloud latencies

Published: 13 November 2021 Publication History

Abstract

Edge computing has emerged as a popular paradigm for running latency-sensitive applications due to its ability to offer lower network latencies to end-users. In this paper, we argue that despite its lower network latency, the resource-constrained nature of the edge can result in higher end-to-end latency, especially at higher utilizations, when compared to cloud data centers. We study this edge performance inversion problem through an analytic comparison of edge and cloud latencies and analyze conditions under which the edge can yield worse performance than the cloud. To verify our analytic results, we conduct a detailed experimental comparison of the edge and the cloud latencies using a realistic application and real cloud workloads. Both our analytical and experimental results show that even at moderate utilizations, the edge queuing delays can offset the benefits of lower network latencies, and even result in performance inversion where running in the cloud would provide superior latencies. We finally discuss practical implications of our results and provide insights into how application designers and service providers should design edge applications and systems to avoid these pitfalls.

Supplementary Material

MP4 File (Cloud and Edge Computing - The Hidden Cost of the Edge_ A Performance Comparison of Edge and Cloud Latencies.mp4)
Presentation video

References

[1]
Azure IoT edge. https://azure.microsoft.com/en-us/services/iot-edge/.
[2]
F. Ahmad and T. Vijaykumar. Joint optimization of idle and cooling power in data centers while maintaining response time. In ACM Sigplan Notices, volume 45, pages 243--256. ACM, 2010.
[3]
G. Bolch, S. Greiner, H. De Meer, and K. S. Trivedi. Queueing networks and Markov chains: modeling and performance evaluation with computer science applications. John Wiley & Sons, 2006.
[4]
D. Bruneo. A stochastic model to investigate data center performance and qos in iaas cloud computing systems. IEEE Transactions on Parallel and Distributed Systems, 25(3):560--569, 2014.
[5]
A. Gandhi, S. Doroudi, M. Harchol-Balter, and A. Scheller-Wolf. Exact analysis of the m/m/k/setup class of markov chains via recursive renewal reward. In ACM SIGMETRICS Performance Evaluation Review, volume 41, pages 153--166. ACM, 2013.
[6]
A. Gandhi, V. Gupta, M. Harchol-Balter, and M. A. Kozuch. Optimality analysis of energy-performance trade-off for server farm management. Performance Evaluation, 67(11):1155--1171, 2010.
[7]
M. C. Gonzalez, C. A. Hidalgo, and A.-L. Barabasi. Understanding individual human mobility patterns. nature, 453(7196):779, 2008.
[8]
M. Harchol-Balter. Performance modeling and design of computer systems: queueing theory in action, chapter 14. Cambridge University Press, 2013.
[9]
Y.-J. Hong, J. Xue, and M. Thottethodi. Dynamic server provisioning to minimize cost in an iaas cloud. In Proceedings of the ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems, pages 147--148. ACM, 2011.
[10]
J. Hsu. How youtube led to google's cloud-gaming service: The tech that made youtube work everywhere promises to do the same for games-[news]. IEEE Spectrum, 56(09):9--10, 2019.
[11]
W. Hu, Y. Gao, K. Ha, J. Wang, B. Amos, Z. Chen, P. Pillai, and M. Satyanarayanan. Quantifying the impact of edge computing on mobile applications. In Proceedings of the 7th ACM SIGOPS Asia-Pacific Workshop on Systems, APSys '16, pages 5:1--5:8, New York, NY, USA, 2016. ACM.
[12]
S. Kelly-Bootle and B. W. Lutek. Chapter 5 - queueing theory. In A. O. Allen, editor, Probability, Statistics, and Queuing Theory with Computer Science Applications (Second Edition), Computer Science and Scientific Computing, pages 247 -- 375. Academic Press, San Diego, second edition edition, 1990.
[13]
A. Khan, X. Yan, S. Tao, and N. Anerousis. Workload characterization and prediction in the cloud: A multiple time series approach. In 2012 IEEE Network Operations and Management Symposium, pages 1287--1294. IEEE, 2012.
[14]
H. Khazaei, J. Misic, and V. B. Misic. Performance analysis of cloud computing centers using m/g/m/m+ r queuing systems. IEEE Transactions on parallel and distributed systems, 23(5):936--943, 2012.
[15]
H. Khazaei, J. Mišić, V. B. Mišić, and S. Rashwand. Analysis of a pool management scheme for cloud computing centers. IEEE Transactions on parallel and distributed systems, 24(5):849--861, 2013.
[16]
J. Kingman. Inequalities in the theory of queues. Journal of the Royal Statistical Society. Series B (Methodological), pages 102--110, 1970.
[17]
L. Kleinrock. Theory, volume 1, queueing systems, 1975.
[18]
C. N. Le Tan, C. Klein, and E. Elmroth. Location-aware load prediction in edge datacenters. In Fog and Mobile Edge Computing (FMEC), 2017 Second International Conference on, pages 25--31. IEEE, 2017.
[19]
J. Li, N. K. Sharma, D. R. Ports, and S. D. Gribble. Tales of the tail: Hardware, os, and application-level sources of tail latency. In Proceedings of the ACM Symposium on Cloud Computing, pages 1--14. ACM, 2014.
[20]
C. Lu, K. Ye, G. Xu, C.-Z. Xu, and T. Bai. Imbalance in the cloud: An analysis on alibaba cluster trace. In 2017 IEEE International Conference on Big Data (Big Data), pages 2884--2892. IEEE, 2017.
[21]
Y. Mao, J. Zhang, S. Song, and K. B. Letaief. Power-delay tradeoff in multi-user mobile-edge computing systems. In IEEE Global Communications Conference (GLOBECOM), pages 1--6. IEEE, 2016.
[22]
E. Nygren, R. K. Sitaraman, and J. Sun. The akamai network: a platform for high-performance internet applications. ACM SIGOPS Operating Systems Review, 44(3):2--19, 2010.
[23]
M. Piorkowski, N. Sarafijanovic-Djukic, and M. Grossglauser. CRAWDAD data set EPFL/mobility. https://crawdad.org/epfl/mobility/20090224/.
[24]
M. H. Rothkopf and P. Rech. Perspectives on queues: Combining queues is not always beneficial. Operations Research, 35(6):906--909, 1987.
[25]
M. Satyanarayanan. Pervasive computing: Vision and challenges. IEEE Personal communications, 8(4):10--17, 2001.
[26]
M. Satyanarayanan. The emergence of edge computing. Computer, 50(1):30--39, 2017.
[27]
M. Satyanarayanan, V. Bahl, R. Caceres, and N. Davies. The case for VM-based cloudlets in mobile computing. IEEE pervasive Computing, 2009.
[28]
M. Shahrad, R. Fonseca, Í. Goiri, G. Chaudhry, P. Batum, J. Cooke, E. Laureano, C. Tresness, M. Russinovich, and R. Bianchini. Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. In 2020 {USENIX} Annual Technical Conference ({USENIX}{ATC} 20), pages 205--218, 2020.
[29]
L. Suresh, M. Canini, S. Schmid, and A. Feldmann. C3: Cutting tail latency in cloud data stores via adaptive replica selection. In 12th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 15), pages 513--527, 2015.
[30]
L. Tong, Y. Li, and W. Gao. A hierarchical edge cloud architecture for mobile computing. In INFOCOM 2016-The 35th Annual IEEE International Conference on Computer Communications, IEEE, pages 1--9. IEEE, 2016.
[31]
J. N. Tsitsiklis and K. Xu. On the power of (even a little) centralization in distributed processing. ACM SIGMETRICS Performance Evaluation Review, 39(1):121--132, 2011.
[32]
J. N. Tsitsiklis and K. Xu. On the power of (even a little) resource pooling. Stochastic Systems, 2(1):1--66, 2012.
[33]
R. Urgaonkar, S. Wang, T. He, M. Zafer, K. Chan, and K. K. Leung. Dynamic service migration and workload scheduling in edge-clouds. Performance Evaluation, 91:205--228, 2015.
[34]
M. Weiser. The computer for the 21st century. Scientific american, 265(3):94--104, 1991.
[35]
W. Whitt. Understanding the efficiency of multi-server service systems. Management Science, 38(5):708--723, 1992.
[36]
W. Whitt. "approximations for the gi/g/m queue". Production and Operations Management, 2(2):114--161, 1993.
[37]
J. Xue, R. Birke, L. Y. Chen, and E. Smirni. Spatial-temporal prediction models for active ticket managing in data centers. IEEE Transactions on Network and Service Management, 15(1):39--52, 2018.

Cited By

View all
  • (2025)Real-Time Data Processing in Agricultural RoboticsComputer Vision Techniques for Agricultural Advancements10.4018/979-8-3693-8019-2.ch014(431-468)Online publication date: 3-Jan-2025
  • (2025)DarwinGame: Playing Tournaments for Tuning Applications in Noisy Cloud EnvironmentsProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707259(264-279)Online publication date: 30-Mar-2025
  • (2025)Application Adaptive Light-Weight Deep Learning (AppAdapt-LWDL) Framework for Enabling Edge Intelligence in Dairy ProcessingIEEE Transactions on Mobile Computing10.1109/TMC.2024.347563424:2(1105-1119)Online publication date: 1-Feb-2025
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SC '21: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis
November 2021
1493 pages
ISBN:9781450384421
DOI:10.1145/3458817
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

In-Cooperation

  • IEEE CS

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 November 2021

Permissions

Request permissions for this article.

Check for updates

Badges

Qualifiers

  • Research-article

Funding Sources

Conference

SC '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)346
  • Downloads (Last 6 weeks)56
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2025)Real-Time Data Processing in Agricultural RoboticsComputer Vision Techniques for Agricultural Advancements10.4018/979-8-3693-8019-2.ch014(431-468)Online publication date: 3-Jan-2025
  • (2025)DarwinGame: Playing Tournaments for Tuning Applications in Noisy Cloud EnvironmentsProceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 110.1145/3669940.3707259(264-279)Online publication date: 30-Mar-2025
  • (2025)Application Adaptive Light-Weight Deep Learning (AppAdapt-LWDL) Framework for Enabling Edge Intelligence in Dairy ProcessingIEEE Transactions on Mobile Computing10.1109/TMC.2024.347563424:2(1105-1119)Online publication date: 1-Feb-2025
  • (2025)A Lightweight Knowledge Distillation and Feature Compression Model for User Click-Through Rates Prediction in Edge Computing ScenariosIEEE Internet of Things Journal10.1109/JIOT.2024.344664012:3(2295-2308)Online publication date: 1-Feb-2025
  • (2024)Computing-aware network (CAN): a systematic design of computing and network convergence算力感知网络:一种算网一体的系统设计Frontiers of Information Technology & Electronic Engineering10.1631/FITEE.240009825:5(633-644)Online publication date: 7-Jun-2024
  • (2024)Cost-Aware Dispersed Resource Probing and Offloading At the Edge: A User-Centric Online Layered Learning ApproachIEEE Transactions on Services Computing10.1109/TSC.2024.3489435(1-16)Online publication date: 2024
  • (2024)Colibri: Efficient Collection of Fine-Grained Resource Metrics Necessary for Mobile Edge Computing2024 IEEE/ACM Symposium on Edge Computing (SEC)10.1109/SEC62691.2024.00011(29-44)Online publication date: 4-Dec-2024
  • (2024)INVAR: Inversion Aware Resource Provisioning and Workload Scheduling for Edge ComputingIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621417(1511-1520)Online publication date: 20-May-2024
  • (2023)Edge Computing Research – A ReviewJournal of Information Technology and Digital World10.36548/jitdw.2023.1.0055:1(62-74)Online publication date: Mar-2023
  • (2023)Better Orchestration for SLO-Oriented Cross-site Microservices in Multi-tenant Cloud/Edge ContinuumProceedings of the 24th International Middleware Conference: Demos, Posters and Doctoral Symposium10.1145/3626564.3629091(9-10)Online publication date: 11-Dec-2023
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media