Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Metrics for heterogeneous scientific workflows: A case study of an earthquake science application

Published: 01 August 2011 Publication History

Abstract

Scientific workflows are a common computational model for performing scientific simulations. They may include many jobs, many scientific codes, and many file dependencies. Since scientific workflow applications may include both high-performance computing (HPC) and high-throughput computing (HTC) jobs, meaningful performance metrics are difficult to define, as neither traditional HPC metrics nor HTC metrics fully capture the extent of the application. We describe and propose the use of alternative metrics to accurately capture the scale of scientific workflows and quantify their efficiency. In this paper, we present several specific practical scientific workflow performance metrics and discuss these metrics in the context of a large-scale scientific workflow application, the Southern California Earthquake Center CyberShake 1.0 Map calculation. Our metrics reflect both computational performance, such as floating-point operations and file access, and workflow performance, such as job and task scheduling and execution. We break down performance into three levels of granularity: the task, the workflow, and the application levels, presenting a complete view of application performance. We show how our proposed metrics can be used to compare multiple invocations of the same application, as well as executions of heterogeneous applications, quantifying the amount of work performed and the efficiency of the work. Finally, we analyze CyberShake using our proposed metrics to determine potential application optimizations.

References

[1]
Callaghan S., et al. (2008) Reducing time-to-solution using distributed high-throughput mega-workflows - experiences from SCEC CyberShake. In: Proceedings of the Fourth IEEE International Conference on e-Science (e-Science 2008), Indianapolis, Indiana, USA.
[2]
Callaghan S., et al. Scaling up workflow-based applications. J Comput Syst Sci 76: 428-446.
[3]
Deelman E., et al. (2005) Pegasus: a framework for mapping complex scientific workflows onto distributed systems. Sci Program J 13: 219-237.
[4]
Dongarra J., London K., Moore S., Mucci P. and Terpstra D. (2001) Using PAPI for hardware performance monitoring on linux systems. In: Proceedings of the Conference on Linux Clusters: The HPC Revolution, Linux Clusters Institute, Urbana, Illinois, 25-27 June.
[5]
Graves R., et al. (2008) Physics based probabilistic seismic hazard calculations for Southern California. In: Proceedings of the 14th World Conference on Earthquake Engineering, Beijing, China, 12-17 October.
[6]
Nerieri F., et al. (2006) Overhead analysis of grid workflow applications . In: Proceedings of the 7th IEEE/ACM International Conference on Grid Computing, pp. 17-24.
[7]
Ostermann S., et al. (2008) On the characteristics of grid workflows . In: Proceedings of the CoreGRID Workshop on Integrated Research in Grid Computing (CGIW'08), pp. 431-442.
[8]
Prodan R. and Fahringer T. (2008) Overhead analysis of scientific workflows in grid environments . IEEE Trans Parallel Distrib Syst 19: 378-393.
[9]
Raicu I., Foster T. and Zhao Y. (2008) Many-task computing for grids and supercomputers . Workshop on Many-Task Computing for Grids and Supercomputers 2008, pp. 1-11.
[10]
Raman R., Livny M. and Soloman M. (1998) Matchmaking: distributed resource management for high throughput computing. In: Proceedings of the Seventh IEEE International Symposium on High Performance Distributed Computing (HPDC-7 `98), pp. 140-146.
[11]
Sfiligoi I glideinWMS-a generic pilot-based workload management system. J Phys Conf Ser.
[12]
Stratan C., Iosup A. and Epema DHJ (2008) A performance study of grid workflow engines. In: Proceedings of the 9th IEEE/ ACM International Conference on Grid Computing, pp. 25-32.
[13]
Taylor I, Deelman E, Gannon D and Shields M (eds) (2006) Workflows in e-Science. New York: Springer.
[14]
Tierney B. and Gunter D. (2003) NetLogger: a toolkit for distributed system performance tuning and debugging. In: Proceedings of the 8th IFIP/IEEE International Symposium on Integrated Network Management.
[15]
Zhao L., Chen P. and Jordan TH (2006) Strain Green's tensors, reciprocity, and their applications to seismic source and structure studies. Bull Seismol Soc Am 96: 1753-1763.

Cited By

View all
  • (2019)Data-Oriented Scheduling with Dynamic-Clustering Fault-Tolerant Technique for Scientific Workflows in CloudsProgramming and Computing Software10.1134/S036176881908009745:8(506-516)Online publication date: 1-Dec-2019
  • (2017)rvGAHPProceedings of the 12th Workshop on Workflows in Support of Large-Scale Science10.1145/3150994.3151003(1-8)Online publication date: 12-Nov-2017
  • (2016)Storage-aware Algorithms for Scheduling of Workflow Ensembles in CloudsJournal of Grid Computing10.1007/s10723-015-9355-614:2(359-378)Online publication date: 1-Jun-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image International Journal of High Performance Computing Applications
International Journal of High Performance Computing Applications  Volume 25, Issue 3
August 2011
92 pages

Publisher

Sage Publications, Inc.

United States

Publication History

Published: 01 August 2011

Author Tags

  1. large-scale simulations
  2. performance metrics
  3. scientific workflows

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2019)Data-Oriented Scheduling with Dynamic-Clustering Fault-Tolerant Technique for Scientific Workflows in CloudsProgramming and Computing Software10.1134/S036176881908009745:8(506-516)Online publication date: 1-Dec-2019
  • (2017)rvGAHPProceedings of the 12th Workshop on Workflows in Support of Large-Scale Science10.1145/3150994.3151003(1-8)Online publication date: 12-Nov-2017
  • (2016)Storage-aware Algorithms for Scheduling of Workflow Ensembles in CloudsJournal of Grid Computing10.1007/s10723-015-9355-614:2(359-378)Online publication date: 1-Jun-2016
  • (2015)Algorithms for cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS cloudsFuture Generation Computer Systems10.1016/j.future.2015.01.00448:C(1-18)Online publication date: 1-Jul-2015
  • (2014)Collecting cloud provenance metadata with MatriohskaProceedings of the 29th Annual ACM Symposium on Applied Computing10.1145/2554850.2555066(351-356)Online publication date: 24-Mar-2014
  • (2013)Federating queries in SPARQL 1.1Web Semantics: Science, Services and Agents on the World Wide Web10.1016/j.websem.2012.10.00118:1(1-17)Online publication date: 1-Jan-2013
  • (2013)A Case Study into Using Common Real-Time Workflow Monitoring Infrastructure for Scientific WorkflowsJournal of Grid Computing10.1007/s10723-013-9265-411:3(381-406)Online publication date: 1-Sep-2013
  • (2012)Cost- and deadline-constrained provisioning for scientific workflow ensembles in IaaS cloudsProceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis10.5555/2388996.2389026(1-11)Online publication date: 10-Nov-2012
  • (2012)Evaluating parameter sweep workflows in high performance computingProceedings of the 1st ACM SIGMOD Workshop on Scalable Workflow Execution Engines and Technologies10.1145/2443416.2443418(1-10)Online publication date: 20-May-2012
  • (2012)Enabling large-scale scientific workflows on petascale resources using MPI master/workerProceedings of the 1st Conference of the Extreme Science and Engineering Discovery Environment: Bridging from the eXtreme to the campus and beyond10.1145/2335755.2335846(1-8)Online publication date: 16-Jul-2012

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media