This paper explores the option of deriving provenance from existing log files, an approach that reduces the instrumentation task substantially but raises ...
Provenance from log files: A BigData problem | Request PDF
www.researchgate.net › publication › 26...
Data provenance is used for describing data evolution, which records the whole data process, including input, output and transformation in scientific workflow.
People also ask
What are the challenges of data provenance?
What are log files in relation to data security?
Mar 22, 2013 · Big data and increased data sharing, together impose a challenge to data provenance to become easier to deploy, easier to understand, and easier ...
Aug 1, 2023 · This paper employs a Hadoop-based big data system as the research object, and proposes a parallel log analysis method based on auxiliary data structures and ...
Missing: problem. | Show results with:problem.
Our datasets consist of 12 provenance graphs from five publicly available log files (i.e.,. T1 to T5) collected by the trace group in transparent comput- ing ...
Sep 16, 2024 · The data provenance problem arises when there is a lack of clear, accurate, and accessible records of this history, which is essential for data ...
Missing: files: | Show results with:files:
The Big Data Provenance Black Box as Reliable Evidence
publications.aaahq.org › jeta › article › S...
Dec 1, 2016 · When applied to data, provenance may be metadata or log files/audit trails pertaining to the lineage of a data event, capturing and recording ...
Provenance from log files: A BigData problem. Conference Paper. Mar 2013. Devarshi Ghoshal · Beth Plale. As new data products of ...
Feb 20, 2019 · Layer 4 provenance is what most Big Data systems record, the application provenance. This is provenance specific to a particular application ...
Data provenance is important for tracking down errors within data and attributing them to sources. Additionally, data provenance can be useful in reporting and ...