No abstract available.
Proceeding Downloads
Automatic generation of workflow provenance
While workflow is playing an increasingly important role in e-Science, current systems lack support for the collection of provenance data. We argue that workflow provenance data should be automatically generated by the enactment engine and managed over ...
Managing rapidly-evolving scientific workflows
We give an overview of VisTrails, a system that provides an infrastructure for systematically capturing detailed provenance and streamlining the data exploration process. A key feature that sets VisTrails apart from previous visualization and scientific ...
Virtual logbooks and collaboration in science and software development
A key feature of collaboration is having a log of what and how is being done – for private use/reuse and for sharing selected parts with collaborators in today's complex, large scale scientific/software environments. Even better if this log is automatic,...
Applying provenance in distributed organ transplant management
The use of ICT solutions applied to Healthcare in distributed scenarios should not only provide improvements in the distributed processes and services they are targeted to assist but also provide ways to trace all the meaningful events and decisions ...
Provenance implementation in a scientific simulation environment
Many of today's engineering applications for simulations are lacking machanisms to trace the generation of results and the underlying processes. Especially computations conducted in distribued computing environments as Grids are lacking suitable means ...
Towards low overhead provenance tracking in near real-time stream filtering
Data streams flowing from the physical environment are as unpredictable as the environment itself. Radars go down, long haul networks drop packets, and readings are corrupted on the wire. Yet the data driven scientific models and data mining algorithms ...
Enabling provenance on large scale e-science applications
Large-scale e-Science experiments present unprecedented data handling requirements with their multi-petabyte data storages. Complex software applications, such as the ATLAS High Energy Physics experiment at CERN, run throughout Grid computing sites ...
Harvesting RDF triples
Managing scientific data requires tools that can track complex provenance information about digital resources and workflows. RDF triples are a convenient abstraction for combining independently-generated factual statements, including statements about ...
Mapping physical formats to logical models to extract data and metadata: the defuddle parsing engine
Scientists, motivated by the desire for systems-level understanding of phenomena, increasingly need to share their results across multiple disciplines. Accomplishing this requires data to be annotated, contextualized, and readily searchable and ...
Annotation and provenance tracking in semantic web photo libraries
As the volume of digital images available on the Web continues to increase, there is a clear need for more advanced techniques for their effective retrieval and management. In this paper, we present a domain independent framework for both annotating and ...
Metadata catalogs with semantic representations
Metadata catalogs store descriptive information about logical data items. These catalogs can then be queried to retrieve the particular logical data item that matches the criteria. However, the query has to be formulated in terms of the metadata ...
Combining provenance with trust in social networks for semantic web content filtering
Social networks are a popular movement on the web. On the Semantic Web, it is simple to make trust annotations to social relationships. In this paper, we present a two level approach to integrating trust, provenance, and annotations in Semantic Web ...
Recording actor state in scientific workflows
The process which leads to a particular data item, or its provenance, may be documented in a number of ways. The recording of actor state assertions – essentially data that a client or service actor may assert about itself regarding an interaction, is ...
Provenance collection support in the kepler scientific workflow system
In many data-driven applications, analysis needs to be performed on scientific information obtained from several sources and generated by computations on distributed resources. Systematic analysis of this scientific information unleashes a growing need ...
A model for user-oriented data provenance in pipelined scientific workflows
Integrated provenance support promises to be a chief advantage of scientific workflow systems over script-based alternatives. While it is often recognized that information gathered during scientific workflow execution can be used automatically to ...
Applying the virtual data provenance model
In many domains of science, engineering, and commerce, data analysis systems are employed to derive new data (and ultimately, one hopes, knowledge) from datasets describing experimental results or simulated phenomena. To support such analyses, we have ...
A provenance model for manually curated data
Many curated databases are constructed by scientists integrating various existing data sources “by hand”, that is, by manually entering or copying data from other sources. Capturing provenance in such an environment is a challenging problem, requiring a ...
Issues in automatic provenance collection
Automatic provenance collection describes systems that observe processes and data transformations inferring, collecting, and maintaining provenance about them. Automatic collection is a powerful tool for analysis of objects and processes, providing a ...
Electronically querying for the provenance of entities
The provenance of entities, whether electronic data or physical artefacts, is crucial information in practically all domains, including science, business and art. The increased use of software in automating activities provides the opportunity to add ...
AstroDAS: sharing assertions across astronomy catalogues through distributed annotation
As diverse scientific data collections migrate online, researchers want the ability to share their assertions regarding the entities that span these disparate databases. We focus on a case study provided by the astronomical community's Virtual ...
Security issues in a SOA-Based provenance system
Recent work has begun exploring the characterization and utilization of provenance in systems based on the Service Oriented Architecture (such as Web Services and Grid based environments). One of the salient issues related to provenance use within any ...
Implementing a secure annotation service
Annotation systems enable “value-adding” to digital resources by the attachment of additional data in the form of comments, explanations, references, reviews and other types of external, subjective remarks. They facilitate group discourse and capture ...
Performance evaluation of the karma provenance framework for scientific workflows
Provenance about workflow executions and data derivations in scientific applications help estimate data quality, track resources, and validate in silico experiments. The Karma provenance framework provides a means to collect workflow, process, and data ...
Exploring provenance in a distributed job execution system
We examine provenance in the context of a distributed job execution system. It is crucial to capture provenance information during the execution of a job in a distributed environment because often this information is lost once the job has finished. In ...
gLite job provenance
- František Dvořák,
- Daniel Kouřil,
- Aleš Křenek,
- Luděk Matyska,
- Miloš Mulač,
- Jan Pospíšil,
- Miroslav Ruda,
- Zdeněk Salvet,
- Jiří Sitera,
- Michal Voců
The Job Provenance (JP) service is designed to automate keeping track of computations on large scale Grids, giving thus users a tool to correctly archive information about their jobs and to re-submit any job in a reconstructed environment. JP provides a ...
An identity crisis in the life sciences
myGrid is an e-Science project assisting life scientists to build workflows that gather data from distributed, autonomous, replicated and heterogeneous resources. The provenance logs of workflow executions are recorded as RDF graphs. The log of one ...
CombeChem: a case study in provenance and annotation using the semantic web
The CombeChem e-Science project has demonstrated the advantages of using Semantic Web technology, in particular RDF and triplestores, to describe and link diverse and complex chemical information, covering the whole process of the generation of chemical ...
Principles of high quality documentation for provenance: a philosophical discussion
Computer technology enables the creation of detailed documentation about the processes that create or affect entities (data, objects, etc.). Such documentation of the past can be used to answer various kinds of questions regarding the processes that led ...
Cited By
- McDaniel P, Butler K, McLaughlin S, Sion R, Zadok E and Winslett M Towards a secure and efficient system for end-to-end provenance Proceedings of the 2nd conference on Theory and practice of provenance, (2-2)
- Munroe S, Miles S, Moreau L and Vázquez-Salceda J PrIMe Proceedings of the 6th international workshop on Software engineering and middleware, (39-46)