Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2457317.2457379acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article

Provenance traces from Chiron parallel workflow engine

Published: 18 March 2013 Publication History

Abstract

Scientific workflows are commonly used to model and execute large-scale scientific experiments. They represent key resources for scientists and are managed by Scientific Workflow Management Systems (SWfMS). The different languages used by SWfMS may impact in the way the workflow engine executes the workflow, sometimes limiting optimization opportunities. To tackle this issue, we recently proposed a scientific workflow algebra [1]. This algebra is inspired by database relational algebra and it enables automatic optimization of scientific workflows to be executed in parallel in high performance computing (HPC) environments. This way, the experiments presented in this paper were executed in Chiron, a parallel scientific workflow engine implemented to support the scientific workflow algebra. Before executing the workflow, Chiron stores the prospective provenance [2] of the workflow on its provenance database. Each workflow is composed by several activities, and each activity consumes relations. Similarly to relational databases, a relation contains a set of attributes and it is composed by a set of tuples. Each tuple in a relation contains a series of values, each one associated to a specific attribute. The tuples of a relation are distributed to be consumed in parallel over the computing resources according to the workflow activity. During and after the execution, the retrospective provenance [2] is also stored.

References

[1]
E. Ogasawara, J. Dias, D. Oliveira, F. Porto, P. Valduriez, and M. Mattoso, "An Algebraic Approach for Data-Centric Scientific Workflows," Proc. of VLDB Endowment, vol. 4, no. 12, pp. 1328--1339, 2011.
[2]
J. Freire, D. Koop, E. Santos, and C. T. Silva, "Provenance for Computational Tasks: A Survey," Computing in Science and Engineering, v.10, no. 3, pp. 11--21, 2008.
[3]
J. Dias, E. Ogasawara, D. Oliveira, F. Porto, A. Coutinho, and M. Mattoso, "Supporting Dynamic Parameter Sweep in Adaptive and User-Steered Workflow," in 6th Workshop on Workflows in Support of Large-Scale Science, Seattle, WA, USA, 2011, pp. 31--36.
[4]
F. Costa, D. Oliveira, K. Ocana, E. Ogasawara, and M. Mattoso, "Enabling Re-Executions of Parallel Scientific Workflows Using Runtime Provenance Data. In: 4th International Provenance and Annotation Workshop," 2012.
[5]
J. Goncalves, D. Oliveira, K. A. C. S. Ocaña, E. Ogasawara, and M. Mattoso, "Using Domain-Specific Data to Enhance Scientific Workflow Steering Queries," in Proc. IPAW 2012, Santa Barbara, CA, 2012.

Cited By

View all
  • (2016)Extracting Semantics from Legacy Scientific Workflows2016 IEEE Tenth International Conference on Semantic Computing (ICSC)10.1109/ICSC.2016.102(9-16)Online publication date: Feb-2016
  • (2016)Re-provisioning of Cloud-Based Execution Infrastructure Using the Cloud-Aware Provenance to Facilitate Scientific Workflow Execution ReproducibilityCloud Computing and Services Science10.1007/978-3-319-29582-4_5(74-94)Online publication date: 3-Feb-2016

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
EDBT '13: Proceedings of the Joint EDBT/ICDT 2013 Workshops
March 2013
423 pages
ISBN:9781450315999
DOI:10.1145/2457317

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 March 2013

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

EDBT/ICDT '13

Acceptance Rates

EDBT '13 Paper Acceptance Rate 7 of 10 submissions, 70%;
Overall Acceptance Rate 7 of 10 submissions, 70%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 10 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Extracting Semantics from Legacy Scientific Workflows2016 IEEE Tenth International Conference on Semantic Computing (ICSC)10.1109/ICSC.2016.102(9-16)Online publication date: Feb-2016
  • (2016)Re-provisioning of Cloud-Based Execution Infrastructure Using the Cloud-Aware Provenance to Facilitate Scientific Workflow Execution ReproducibilityCloud Computing and Services Science10.1007/978-3-319-29582-4_5(74-94)Online publication date: 3-Feb-2016

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media