Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3431379.3460646acmconferencesArticle/Chapter ViewAbstractPublication PageshpdcConference Proceedingsconference-collections
research-article
Public Access

LaSS: Running Latency Sensitive Serverless Computations at the Edge

Published: 21 June 2021 Publication History

Abstract

Serverless computing has emerged as a new paradigm for running short-lived computations in the cloud. Due to its ability to handle IoT workloads, there has been considerable interest in running serverless functions at the edge. However, the constrained nature of the edge and the latency sensitive nature of workloads result in many challenges for serverless platforms. In this paper, we present LaSS, a platform that uses model-driven approaches for running latency-sensitive serverless computations on edge resources. LaSS uses principled queuing-based methods to determine an appropriate allocation for each hosted function and auto-scales the allocated resources in response to workload dynamics. LaSS uses a fair-share allocation approach to guarantee a minimum of allocated resources to each function in the presence of overload. In addition, it utilizes resource reclamation methods based on container deflation and termination to reassign resources from over-provisioned functions to under-provisioned ones. We implement a prototype of our approach on an OpenWhisk serverless edge cluster and conduct a detailed experimental evaluation. Our results show that LaSS can accurately predict the resources needed for serverless functions in the presence of highly dynamic workloads, and reprovision container capacity within hundreds of milliseconds while maintaining fair share allocation guarantees.

References

[1]
2019 (accessed June 30, 2020). AWS Lambda enables functions that can run up to 15 minutes. https://aws.amazon.com/about-aws/whats-new/2018/10/aws-lambda-supports-functions-that-can-run-up-to-15-minutes/.
[2]
accessed Janurary 23, 2021. Apache OpenWhisk. https://openwhisk.apache.org/.
[3]
accessed Janurary 23, 2021. AWS Lambda now supports container images as a packaging format. https://aws.amazon.com/about-aws/whats-new/2020/12/aws-lambda-now-supports-container-images-as-a-packaging-format/.
[4]
accessed Janurary 23, 2021. Azure Functions custom handlers. https://docs. microsoft.com/en-us/azure/azure-functions/functions-custom-handlers.
[5]
accessed Janurary 23, 2021. BinaryAlert. https://binaryalert.io/.
[6]
accessed Janurary 23, 2021. Knative. https://knative.dev.
[7]
accessed Janurary 23, 2021. OpenWhisk Blackbox Actions. https://github.com/apache/openwhisk-runtime-docker/blob/dockerskeleton%401.14.0/sdk/docker/README.md.
[8]
accessed Janurary 23, 2021. torchvision. https://github.com/pytorch/vision.
[9]
accessed June 30, 2020. AWS Lambda. https://aws.amazon.com/lambda/.
[10]
accessed June 30, 2020. Azure Functions. https://azure.microsoft.com/en-us/ services/functions/.
[11]
accessed June 30, 2020. Cloud Functions. https://cloud.google.com/functions.
[12]
accessed June 30, 2020. Kubeless: The Kubernetes Native Serverless Framework. https://kubeless.io/.
[13]
Gojko Adzic and Robert Chatley. 2017. Serverless Computing: Economic and Architectural Impact. In Proceedings of the ACM ESEC/FSE '17. 884--889.
[14]
Alexandru Agache, Marc Brooker, Alexandra Iordache, Anthony Liguori, Rolf Neugebauer, Phil Piwonka, and Diana-Maria Popa. 2020. Firecracker: Lightweight Virtualization for Serverless Applications. In NSDI '20. 419--434.
[15]
Istemi Ekin Akkus et almbox. 2018. {SAND}: Towards High-Performance Serverless Computing. In USENIX ATC '18. 923--935.
[16]
FSQ Alves et almbox. 2011. Upper Bounds on Performance Measures of Heterogeneous $M/M/c$ Queues. Mathematical Problems in Engineering, Vol. 2011 (2011).
[17]
Ganesh Ananthanarayanan et almbox. 2017. Real-time video analytics: The killer app for edge computing. computer, Vol. 50, 10 (2017), 58--67.
[18]
Lixiang Ao et almbox. 2018. Sprocket: A serverless video processing framework. In Proceedings of the ACM Symposium on Cloud Computing. 263--274.
[19]
Austin Aske and Xinghui Zhao. 2018. Supporting multi-provider serverless computing on the edge. In Proceedings of ACM ICPP '18. 1--6.
[20]
Ioana Baldini et almbox. 2017. The serverless trilemma: Function composition for serverless computing. In Proceedings of ACM SIGPLAN Onward! 2017. 89--103.
[21]
Nikhil Bansal and Mor Harchol-Balter. 2001. Analysis of SRPT scheduling: Investigating unfairness. In ACM SIGMETRICS '01. 279--290.
[22]
Paul Castro et almbox. 2017. Serverless programming (function as a service). In IEEE ICDCS '17. 2658--2659.
[23]
Ryan Chard, Yadu Babuji, Zhuozhao Li, Tyler Skluzacek, Anna Woodard, Ben Blaiszik, Ian Foster, and Kyle Chard. 2020. Funcx: A federated function serving fabric for science. In Proceedings of the 29th International Symposium on High-Performance Parallel and Distributed Computing. 65--76.
[24]
Sadjad Fouladi et almbox. 2019. From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers. In USENIX ATC '19. 475--488.
[25]
Armando Fox et almbox. 2009. Above the clouds: A berkeley view of cloud computing. Dept. Electrical Eng. and Comput. Sciences, University of California, Berkeley, Rep. UCB/EECS, Vol. 28, 13 (2009).
[26]
Alexander Fuerst, Ahmed Ali-Eldin, Prashant Shenoy, and Prateek Sharma. 2020. Cloud-Scale VM-Deflation for Running Interactive Applications On Transient Servers. In Proceedings of ACM HPDC '20. 53--64.
[27]
Anshul Gandhi, Parijat Dube, Alexei Karve, Andrzej Kochut, and Li Zhang. 2014. Adaptive, model-driven autoscaling for cloud applications. In 11th International Conference on Autonomic Computing (ICAC '14). 57--64.
[28]
Mor Harchol-Balter. 2013. Performance modeling and design of computer systems: queueing theory in action. Cambridge University Press.
[29]
Mor Harchol-Balter, Bianca Schroeder, Nikhil Bansal, and Mukesh Agrawal. 2003. Size-based scheduling to improve web performance. ACM Transactions on Computer Systems (TOCS), Vol. 21, 2 (2003), 207--233.
[30]
Joseph M Hellerstein et almbox. [n.d.]. Serverless computing: One step forward, two steps back. In CIDR 2019.
[31]
Forrest N Iandola et almbox. 2016. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and 0.5 MB model size. arXiv preprint arXiv:1602.07360 (2016).
[32]
Sally F Issawi, Alaa Al Halees, and Mohammed Radi. 2015. An efficient adaptive load balancing algorithm for cloud computing under Bursty workloads. Engineering, Technology & Applied Science Research, Vol. 5, 3 (2015), 795--800.
[33]
Woochul Kang and Jaeyong Chung. 2019. DeepRT: predictable deep learning inference for cyber-physical systems. Real-Time Systems, Vol. 55, 1 (2019), 106--135.
[34]
Hamzeh Khazaei, Jelena Misic, and Vojislav B Misic. 2011. Performance analysis of cloud computing centers using M/G/M/mr queuing systems. IEEE Transactions on parallel and distributed systems, Vol. 23, 5 (2011), 936--943.
[35]
Ana Klimovic et almbox. 2018. Pocket: Elastic ephemeral storage for serverless analytics. In USENIX OSDI '18). 427--444.
[36]
Ricardo Koller and Dan Williams. 2017. Will serverless end the dominance of linux in the cloud?. In Proceedings of HotOS '17. 169--173.
[37]
Ningning Ma, Xiangyu Zhang, Hai-Tao Zheng, and Jian Sun. 2018. Shufflenet v2: Practical guidelines for efficient cnn architecture design. In ECCV '18. 116--131.
[38]
Hai Duc Nguyen et almbox. 2019. Real-time Serverless: Enabling Application Performance Guarantees. In WoSC '19. 1--6.
[39]
Xingzhi Niu et almbox. 2019. Leveraging Serverless Computing to Improve Performance for Sequence Comparison. In Proceedings of the ACM BCB '19. 683--687.
[40]
Mark Sandler et almbox. 2018. Mobilenetv2: Inverted residuals and linear bottlenecks. In Proceedings of the IEEE CVPR '18. 4510--4520.
[41]
Mahadev Satyanarayanan. 2017. The emergence of edge computing. Computer, Vol. 50, 1 (2017), 30--39.
[42]
Mohammad Shahrad et almbox. 2020. Serverless in the Wild: Characterizing and Optimizing the Serverless Workload at a Large Cloud Provider. In USENIX ATC.
[43]
Mohammad Shahrad, Jonathan Balkind, and David Wentzlaff. 2019. Architectural implications of function-as-a-service computing. In Proceedings of IEEE/ACM MICRO '52. 1063--1075.
[44]
Prateek Sharma et almbox. 2019. Resource deflation: A new approach for transient resource reclamation. In Proceedings of the Fourteenth EuroSys Conference. 1--17.
[45]
Weisong Shi et almbox. 2016. Edge computing: Vision and challenges. IEEE internet of things journal, Vol. 3, 5 (2016), 637--646.
[46]
Amoghvarsha Suresh and Anshul Gandhi. 2019. FnSched: An Efficient Scheduler for Serverless Functions. In Proceedings of WoSC '19. 19--24.
[47]
Jianzhe Tai, Juemin Zhang, Jun Li, Waleed Meleis, and Ningfang Mi. 2011. ArA: Adaptive resource allocation for cloud computing environments under bursty workloads. In 30th IEEE International Performance Computing and Communications Conference. 1--8.
[48]
Bo Tan, Haikun Liu, Jia Rao, Xiaofei Liao, Hai Jin, and Yu Zhang. 2020. Towards Lightweight Serverless Computing via Unikernel as a Function. In IWQoS '20. 1--10.
[49]
Bhuvan Urgaonkar, Giovanni Pacifici, Prashant Shenoy, Mike Spreitzer, and Asser Tantawi. 2005. An Analytical Model for Multi-Tier Internet Services and Its Applications. In ACM SIGMETRICS '05. 291--302.
[50]
Liang Wang et almbox. 2018a. Peeking behind the curtains of serverless platforms. In USENIX ATC. 133--146.
[51]
Robert J Wang et almbox. 2018b. Pelee: A real-time object detection system on mobile devices. In Advances in Neural Information Processing Systems. 1963--1972.
[52]
Adam Wierman, Lachlan LH Andrew, and Ao Tang. 2009. Power-aware speed scaling in processor sharing systems. In IEEE INFOCOM 2009. IEEE, 2007--2015.
[53]
Shanhe Yi et almbox. 2017. Lavea: Latency-aware video analytics on edge computing platform. In Proceedings of ACM/IEEE SEC '17. 1--13.
[54]
Chengliang Zhang, Minchen Yu, Wei Wang, and Feng Yan. 2019 c. Mark: Exploiting cloud services for cost-effective, slo-aware machine learning inference serving. In USENIX ATC '19. 1049--1062.
[55]
Miao Zhang et almbox. 2019 a. Video processing with serverless computing: a measurement study. In Proceedings of the NOSSDAV '19. 61--66.
[56]
Tian Zhang, Dong Xie, Feifei Li, and Ryan Stutsman. 2019 b. Narrowing the gap between serverless and its state with storage functions. In ACM SoCC '19. 1--12.

Cited By

View all
  • (2024)Latency-Sensitive Function Placement among Heterogeneous Nodes in Serverless ComputingSensors10.3390/s2413419524:13(4195)Online publication date: 27-Jun-2024
  • (2024)AI-Driven QoS-Aware Scheduling for Serverless Video Analytics at the EdgeInformation10.3390/info1508048015:8(480)Online publication date: 13-Aug-2024
  • (2024)Accountable Carbon Footprints and Energy Profiling For Serverless FunctionsProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698531(522-541)Online publication date: 20-Nov-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
HPDC '21: Proceedings of the 30th International Symposium on High-Performance Parallel and Distributed Computing
June 2021
275 pages
ISBN:9781450382175
DOI:10.1145/3431379
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 June 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. cloud computing
  2. edge computing
  3. function-as-a-service (faas)
  4. queueing theory
  5. serverless computing
  6. service-level agreement (sla)

Qualifiers

  • Research-article

Funding Sources

  • NSF

Conference

HPDC '21
Sponsor:

Acceptance Rates

Overall Acceptance Rate 166 of 966 submissions, 17%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)446
  • Downloads (Last 6 weeks)46
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Latency-Sensitive Function Placement among Heterogeneous Nodes in Serverless ComputingSensors10.3390/s2413419524:13(4195)Online publication date: 27-Jun-2024
  • (2024)AI-Driven QoS-Aware Scheduling for Serverless Video Analytics at the EdgeInformation10.3390/info1508048015:8(480)Online publication date: 13-Aug-2024
  • (2024)Accountable Carbon Footprints and Energy Profiling For Serverless FunctionsProceedings of the 2024 ACM Symposium on Cloud Computing10.1145/3698038.3698531(522-541)Online publication date: 20-Nov-2024
  • (2024)NEPTUNE: A Comprehensive Framework for Managing Serverless Functions at the EdgeACM Transactions on Autonomous and Adaptive Systems10.1145/363475019:1(1-32)Online publication date: 14-Feb-2024
  • (2024)Efficient Cross-platform Multiplexing of Hardware Performance Counters via Adaptive GroupingACM Transactions on Architecture and Code Optimization10.1145/362952521:1(1-26)Online publication date: 19-Jan-2024
  • (2024)Faashouse: Sustainable Serverless Edge Computing Through Energy-Aware Resource SchedulingIEEE Transactions on Services Computing10.1109/TSC.2024.335429617:4(1533-1547)Online publication date: Jul-2024
  • (2024)LEASE: Leveraging Energy-Awareness in Serverless Edge for Latency-Sensitive IoT Services2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events (PerCom Workshops)10.1109/PerComWorkshops59983.2024.10502788(302-307)Online publication date: 11-Mar-2024
  • (2024)INVAR: Inversion Aware Resource Provisioning and Workload Scheduling for Edge ComputingIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621417(1511-1520)Online publication date: 20-May-2024
  • (2024)Online Container Caching with Late-Warm for IoT Data Processing2024 IEEE 40th International Conference on Data Engineering (ICDE)10.1109/ICDE60146.2024.00127(1547-1560)Online publication date: 13-May-2024
  • (2024)Efficient Serverless Function Scheduling in Edge ComputingICC 2024 - IEEE International Conference on Communications10.1109/ICC51166.2024.10622991(1029-1034)Online publication date: 9-Jun-2024
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media