Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2806777.2806851acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

SpotOn: a batch computing service for the spot market

Published: 27 August 2015 Publication History

Abstract

Cloud spot markets enable users to bid for compute resources, such that the cloud platform may revoke them if the market price rises too high. Due to their increased risk, revocable resources in the spot market are often significantly cheaper (by as much as 10×) than the equivalent non-revocable on-demand resources. One way to mitigate spot market risk is to use various fault-tolerance mechanisms, such as checkpointing or replication, to limit the work lost on revocation. However, the additional performance overhead and cost for a particular fault-tolerance mechanism is a complex function of both an application's resource usage and the magnitude and volatility of spot market prices.
We present the design of a batch computing service for the spot market, called SpotOn, that automatically selects a spot market and fault-tolerance mechanism to mitigate the impact of spot revocations without requiring application modification. SpotOn's goal is to execute jobs with the performance of on-demand resources, but at a cost near that of the spot market. We implement and evaluate SpotOn in simulation and using a prototype on Amazon's EC2 that packages jobs in Linux Containers. Our simulation results using a job trace from a Google cluster indicate that SpotOn lowers costs by 91.9% compared to using on-demand resources with little impact on performance.

References

[1]
Scientific Computing Using Spot Instances. http://aws.amazon.com/ec2/spot-and-science/, June 2013.
[2]
ClusterK. https://clusterk.com/, July 10th 2015.
[3]
Google preemptible instances. https://cloud.google.com/compute/docs/instances/preemptible, July 10th 2015.
[4]
J. Barr. New - EC2 Spot Instance Termination Notices. https://aws.amazon.com/blogs/aws/new-ec2-spot-instance-termination-notices/, January 6th 2015.
[5]
C. Clark, K. Fraser, S. Hand, J. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Waeldr. Live Migration of Virtual Machines. In NSDI, May 2005.
[6]
B. Cully, G. Lefebvre, D. Meyer, M. Feeley, N. Hutchinson, and A. Warfield. Remus: High Availability via Asynchronous Virtual Machine Replication. In NSDI, April 2008.
[7]
S. Di, D. Kondo, and F. Cappello. Characterizing and Modeling Cloud Applications/Jobs on a Google Data Center. Supercomputing, 69(1), July 2014.
[8]
M. R. Hines, U. Deshpande, and K. Gopalan. Post-copy Live Migration of Virtual Machines. SIGOPS Operating Systems Review, 43(3), July 2009.
[9]
S. Khatua and N. Mukherjee. Application-centric Resource Provisioning for Amazon EC2 Spot Instances. In EuroPar, August 2013.
[10]
M. Mao and M. Humphrey. A Performance Study on VM Startup Time in the Cloud. In CLOUD, June 2012.
[11]
M. Mattess, C. Vecchiola, and R. Buyya. Managing Peak Loads by Leasing Cloud Infrastructure Services from a Spot Market. In HPCC, September 2010.
[12]
M. Mazzucco and M. Dumas. Achieving Performance and Availability Guarantees with Spot Instances. In HPCC, September 2011.
[13]
I. Menache, O. Shamir, and N. Jain. On-demand, Spot, or Both: Dynamic Resource Allocation for Executing Batch Jobs in the Cloud. In ICAC, 2014.
[14]
A. Mu'alem and D. Feitelson. Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling. TPDS, 12(6), June 2001.
[15]
C. Reiss, J. Wilkes, and J. L. Hellerstein. Google Cluster-usage Traces: Format + Schema. Technical report, Google Inc., November 2011.
[16]
P. Sharma, S. Lee, T. Guo, D. Irwin, and P. Shenoy. SpotCheck: Designing a Derivative IaaS Cloud on the Spot Market. In EuroSys, April 2015.
[17]
R. Singh, D. Irwin, P. Shenoy, and K. Ramakrishnan. Yank: Enabling Green Data Centers to Pull the Plug. In NSDI, April 2013.
[18]
Y. Song, M. Zafer, and K. Lee. Optimal Bidding in Spot Instance Market. In Infocom, March 2012.
[19]
S. Tang, J. Yuan, and X. Li. Towards Optimal Bidding Strategy for Amazon EC2 Cloud Spot Instance. In CLOUD, June 2012.
[20]
D. Tsafrir, Y. Etsion, and D. Feitelson. Modeling User Runtime Estimates. In JSSP, 2005.
[21]
W. Voorsluys and R. Buyya. Reliable Provisioning of Spot Instances for Compute-Intensive Applications. In AINA, 2012.
[22]
S. Yi, D. Kondo, and A. Andrzejak. Reducing Costs of Spot Instances via Checkpointing in the Amazon Elastic Compute Cloud. In CLOUD, July 2010.
[23]
M. Zafer, Y. Song, and K. Lee. Optimal Bids for Spot VMs in a Cloud for Deadline Constrained Jobs. In CLOUD, 2012.
[24]
S. Zaman and D. Grosu. Efficient Bidding for Virtual Machine Instances in Clouds. In CLOUD, July 2011.
[25]
Q. Zhang, E. Gürses, R. Boutaba, and J. Xiao. Dynamic Resource Allocation for Spot Markets in Clouds. In HotICE, March 2011.

Cited By

View all
  • (2024)Improving Amazon EC2 Spot Instances Price Prediction using Machine Learning AlgorithmInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology10.32628/CSEIT2410210210:2(713-720)Online publication date: 22-Apr-2024
  • (2024)Making Cloud Spot Instance Interruption Events VisibleProceedings of the ACM Web Conference 202410.1145/3589334.3645548(2998-3009)Online publication date: 13-May-2024
  • (2024)Workload-Aware Live Migratable Cloud Instance Detector2024 IEEE 24th International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid59990.2024.00029(178-188)Online publication date: 6-May-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SoCC '15: Proceedings of the Sixth ACM Symposium on Cloud Computing
August 2015
446 pages
ISBN:9781450336512
DOI:10.1145/2806777
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 August 2015

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. batch job
  2. fault-tolerance
  3. spot market

Qualifiers

  • Research-article

Funding Sources

Conference

SoCC '15
Sponsor:
SoCC '15: ACM Symposium on Cloud Computing
August 27 - 29, 2015
Hawaii, Kohala Coast

Acceptance Rates

SoCC '15 Paper Acceptance Rate 34 of 157 submissions, 22%;
Overall Acceptance Rate 169 of 722 submissions, 23%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)45
  • Downloads (Last 6 weeks)1
Reflects downloads up to 04 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Improving Amazon EC2 Spot Instances Price Prediction using Machine Learning AlgorithmInternational Journal of Scientific Research in Computer Science, Engineering and Information Technology10.32628/CSEIT2410210210:2(713-720)Online publication date: 22-Apr-2024
  • (2024)Making Cloud Spot Instance Interruption Events VisibleProceedings of the ACM Web Conference 202410.1145/3589334.3645548(2998-3009)Online publication date: 13-May-2024
  • (2024)Workload-Aware Live Migratable Cloud Instance Detector2024 IEEE 24th International Symposium on Cluster, Cloud and Internet Computing (CCGrid)10.1109/CCGrid59990.2024.00029(178-188)Online publication date: 6-May-2024
  • (2024)An Online Algorithm Based on Replication for Using Spot Instances in IaaS CloudsJournal of Computer Science and Technology10.1007/s11390-023-1535-439:1(103-115)Online publication date: 1-Feb-2024
  • (2024)An Online Algorithm for Cost Minimization of Amazon EC2 Burstable ResourcesDistributed Computing and Intelligent Technology10.1007/978-3-031-50583-6_8(117-132)Online publication date: 4-Jan-2024
  • (2023)FSP: Towards Flexible Synchronous Parallel Frameworks for Distributed Machine LearningIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.322873334:2(687-703)Online publication date: 1-Feb-2023
  • (2023)Scheduling Bag-of-Tasks in Clouds Using Spot and Burstable Virtual MachinesIEEE Transactions on Cloud Computing10.1109/TCC.2021.312542611:1(984-996)Online publication date: 1-Jan-2023
  • (2022)Cost-Effective Spot Instances Provisioning Using Features of Cloud MarketsInternational Journal of Cloud Applications and Computing10.4018/IJCAC.30827612:1(1-27)Online publication date: 30-Nov-2022
  • (2022)CoSpotProceedings of the 13th Symposium on Cloud Computing10.1145/3542929.3563499(540-556)Online publication date: 7-Nov-2022
  • (2022)SciSpot: Scientific Computing On Temporally Constrained Cloud Preemptible VMsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2022.315727233:12(3575-3588)Online publication date: 1-Dec-2022
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media