Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1982185.1982217acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
research-article

Improving high-performance computations on clouds through resource underutilization

Published: 21 March 2011 Publication History

Abstract

We investigate the effects of shared resources for high-performance computing in a commercial cloud environment where multiple virtual machines share a single hardware node. Although good performance is occasionally obtained, contention degrades the expected performance and introduces significant variance. Using the DGEMM kernel and the HPL benchmark, we show that the underutilization of resources considerably improves expected performance by reducing contention for the CPU and cache space. For instance, for some cluster configurations, the solution is reached almost an order of magnitude earlier on average when the available resources are underutilized. The performance benefits for single node computations are even more impressive: Underutilization improves the expected execution time by two orders of magnitude. Finally, in contrast to unshared clusters, extending underutilized clusters by adding more nodes often improves the execution time due to an increased parallelism even with a slow interconnect. In the best case, by underutilizing the nodes performance was improved enough to entirely offset the cost of an extra node in the cluster.

References

[1]
Jack Dongarra, Robert van de Geijn, and David Walker. Scalability issues affecting the design of a dense linear algebra library. Journal of Parallel and Distributed Computing, 22(3): 523--537, September 1994.
[2]
Gene H. Golub and Charles F. Van Loan. Matrix computations (3rd ed.). Johns Hopkins University Press, 1996.
[3]
Kazushige Goto. GotoBLAS. Available via the WWW. Cited 1 Jan 2010. http://www.tacc.utexas.edu/resources/software/#blas.
[4]
Argonne National Laboratory. MPICH2: High-performance and widely portable MPI. Available via the WWW. Cited 1 Jan 2010. http://www.mcs.anl.gov/research/projects/mpich2/.
[5]
Jeff Napper and Paolo Bientinesi. Can cloud computing reach the Top500? In Unconventional High-Performance Computing (UCHPC), May 2009.
[6]
Paolo Bientinesi, Roman Iakymchuk, and Jeff Napper. HPC on Competitive Cloud Resources. In Handbook of Cloud Computing. Springer, 2010.
[7]
Antoine Petitet, R. Clint Whaley, Jack Dongarra, and Andrew Cleary. HPL - a portable implementation of the high-performance LINPACK benchmark for distributed-memory computers. Available via the WWW. Cited 1 Jan 2010. http://www.netlib.org/benchmark/hpl/.
[8]
Amazon Web Services. Amazon elastic compute cloud (EC2). Available via the WWW. Cited 1 Jan 2010. http://aws.amazon.com/ec2.
[9]
Jörg Strebel and Alexander Stage. An economic decision model for business software application deployment on hybrid cloud environments. In Matthias Schumann, Lutz M. Kolbe, and Michael H. Breitner, editors, Tagungsband Multikonferenz Wirtschaftsinformatik, 2010. forthcoming.
[10]
TOP500. Org. Top 500 supercomputer sites. Available via the WWW. Cited 1 Jan 2010. http://www.top500.org/.
[11]
Edward Walker. Benchmarking Amazon EC2 for High-Performance Scientific Computing. ;login:, 33(5): 18--23, October 2008.
[12]
Guohui Wang and Eugene Ng. The impact of virtualization on network performance of Amazon EC2 data center. In INFOCOM '10: Proceedings of the 2010 IEEE Conference on Computer Communications. IEEE Communication Society, 2010.

Cited By

View all
  • (2019)Impact of using multi-levels of parallelism on HPC applications performance hosted on Azure cloud computingInternational Journal of High Performance Computing and Networking10.5555/3337645.333764613:3(251-260)Online publication date: 1-Jan-2019
  • (2018)Exploiting Parallel R in the Cloud with SPRINTMethods of Information in Medicine10.3414/ME11-02-003952:01(80-90)Online publication date: 24-Jan-2018
  • (2017)Running HPC applications on many million cores Cloud2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)10.23919/MIPRO.2017.7973420(209-214)Online publication date: May-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SAC '11: Proceedings of the 2011 ACM Symposium on Applied Computing
March 2011
1868 pages
ISBN:9781450301138
DOI:10.1145/1982185
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 March 2011

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Amazon EC2
  2. cloud computing
  3. compute intensive applications
  4. high-performance computing

Qualifiers

  • Research-article

Funding Sources

Conference

SAC'11
Sponsor:
SAC'11: The 2011 ACM Symposium on Applied Computing
March 21 - 24, 2011
TaiChung, Taiwan

Acceptance Rates

Overall Acceptance Rate 1,650 of 6,669 submissions, 25%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 12 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Impact of using multi-levels of parallelism on HPC applications performance hosted on Azure cloud computingInternational Journal of High Performance Computing and Networking10.5555/3337645.333764613:3(251-260)Online publication date: 1-Jan-2019
  • (2018)Exploiting Parallel R in the Cloud with SPRINTMethods of Information in Medicine10.3414/ME11-02-003952:01(80-90)Online publication date: 24-Jan-2018
  • (2017)Running HPC applications on many million cores Cloud2017 40th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)10.23919/MIPRO.2017.7973420(209-214)Online publication date: May-2017
  • (2016)Constructing Performance-Predictable Clusters with Performance-Varying Resources of CloudsIEEE Transactions on Computers10.1109/TC.2015.251064865:9(2709-2724)Online publication date: 1-Sep-2016
  • (2015)Scalable system for e-orders as a service in cloudIEEE EUROCON 2015 - International Conference on Computer as a Tool (EUROCON)10.1109/EUROCON.2015.7313733(1-6)Online publication date: Sep-2015
  • (2015)Modeling the Speedup for Scalable Web ServicesICT Innovations 201410.1007/978-3-319-09879-1_18(177-186)Online publication date: 2015
  • (2014)Application for modern energy efficient data center2014 22nd Telecommunications Forum Telfor (TELFOR)10.1109/TELFOR.2014.7034602(1114-1117)Online publication date: Nov-2014
  • (2014)The FEDERICA infrastructure and experienceComputer Networks: The International Journal of Computer and Telecommunications Networking10.1016/j.bjp.2013.12.02961:C(176-183)Online publication date: 14-Mar-2014
  • (2014)Resource Scaling Performance for Cache Intensive Algorithms in Windows AzureIntelligent Distributed Computing VII10.1007/978-3-319-01571-2_10(77-86)Online publication date: 2014
  • (2013)L3B: Low level load balancer in the cloudEurocon 201310.1109/EUROCON.2013.6624994(250-257)Online publication date: Jul-2013
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media