Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article
Free access

A primer on provenance

Published: 01 May 2014 Publication History

Abstract

Better understanding data requires tracking its history and context.

References

[1]
Amsterdamer, Y. et al. Putting lipstick on pig: Enabling database-style workflow provenance. In Proceedings of the VLDB Endowment 5, 4 (2011), 346--357.
[2]
Biton, O., Cohen-Boulakia, S. and Davidson, S.B. ZOOM*UserViews: Querying relevant provenance in workflow systems. In Proceedings of the 33rd International Conference on Very Large Databases, (2007), 366--1369.
[3]
Blum, M. Coin flipping by telephone: a protocol for solving impossible problems. In Advances in Cryptology---A Report on CRYPTO '81, (1982).
[4]
Borkin, M.A. et al. Evaluation of filesystem provenance visualization tools. IEEE Transactions on Visualization and Computer Graphics 19, 12 (2013), 2476--2485.
[5]
Braun, U., Shinnar, A., Seltzer, M. 2008. Securing provenance. In Proceedings of the 3rd Usenix Workshop on Hot Topics in Security, (2008), 1--5.
[6]
Braun, U. et al. Issues in automatic provenance collection. In Proceedings of the International Conference on Provenance and Annotation of Data, (2006), 171--183.
[7]
Buneman, P., Khanna, S. and Tan, W.C. Why and where: A characterization of data provenance. In Proceedings of the 8th International Conference on Database Theory, (2002), 316--330.
[8]
Callahan, S.P. et al. Towards process provenance for existing applications. In Proceedings of the 2nd International Provenance and Annotation Workshop, (2008), 120--127.
[9]
Cui, Y., Widom, J. and Wiener, J.L. Tracing the lineage of view data in a warehousing environment. ACM Transactions on Database Systems 25, 2 (2000), 179--227.
[10]
Freire, J. et al. Managing rapidly evolving scientific workflows. In Proceedings of the International Conference on Provenance and Annotation of Data, (2006), 10--18.
[11]
Gates, C. and Bishop, M. One of these records is not like the others. In Proceedings of the 3rd Usenix Workshop on the Theory and Practice of Provenance, (2011).
[12]
Gehani, A. and Tariq, D. SPADE: Support for provenance auditing in distributed environments. In Proceedings of the 13th International Middleware Conference, (2012), 101--120.
[13]
Green, T. J., Karvounarakis, G., Tannen, V. Provenance semirings. In Proceedings of the 26th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, (2007), 31--40.
[14]
Guo, P.J., and Seltzer, M. Burrito: Wrapping your lab notebook in computational infrastructure. In Proceedings of the 4th Usenix Conference on Theory and Practice of Provenance, (2012) 7--7.
[15]
Halevy, D. and Shamir, A. The LSD broadcast encryption scheme. In Advances in Cryptology, (2002), 47--60.
[16]
Hasan, R., Sion, R. and Winslett, M. The case of the fake Picasso: preventing history forgery with secure provenance. In Proceedings of the 7th Conference on File and Storage Technologies, (2009), 1--14.
[17]
Macko, P. and Seltzer, M. A general-purpose provenance library. In Proceedings of the 4th Usenix Conference on Theory and Practice of Provenance, (2012), 6--6.
[18]
Macko, P. and Seltzer, M. Provenance Map Orbiter: interactive exploration of large provenance graphs. In Proceedings of the 3rd Conference on Theory and Practice of Provenance, (2011)
[19]
McDaniel, P. et al. Towards a secure and efficient system for end-to-end provenance. In Proceedings of the 2nd Conference on Theory and Practice of Provenance, (2010), 2--2.
[20]
Moreau, L. and Missier, P. PROV-DM: The PROV Data Model. Technical Report. World Wide Web Consortium, 2013.
[21]
Moreau, L., et al. The Open Provenance Model Core Specification (V1.1). Future Generations Computer Systems 27, 6 (2011), 743--756.
[22]
Muniswamy-Reddy, K.-K., et al. Layering in provenance systems. In Proceedings of the Usenix Annual Technical Conference, 2009.
[23]
Muniswamy-Reddy, K.-K., et al. Provenance-aware storage systems. In Proceedings of the Usenix Annual Technical Conference, (2006), 43--56.
[24]
Park, H., Ikeda, R. and Widom, J. RAMP: A system for capturing and tracing provenance in MapReduce workflows. In Proceedings of the 37th International Conference on Very Large Databases, (2011).
[25]
Saxena, P., Sekar, R. and Puranik, V. Efficient fine-grained binary instrumentation with applications to taint-tracking. In Proceedings of the 6th Annual IEEE/ACM International Symposium on Code Generation and Optimization, (2008), 74--83.
[26]
Scheidegger, C., et al. Tackling the provenance challenge one layer at a time. Concurrency and Computation: Practice and Experience 20, 5 (2008), 473--483.
[27]
Shamir, A. 1979. How to share a secret. Commun. ACM 22, 11 (Nov. 1979), 612--613.
[28]
Widom, J. Trio: A system for integrated management of data, accuracy, and lineage. Technical Report 2004-40, 2004.

Cited By

View all
  • (2024)PROV-IO$^+$+: A Cross-Platform Provenance Framework for Scientific Data on HPC SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.337455535:5(844-861)Online publication date: 14-Mar-2024
  • (2024)Bashing irreproducibility with shournalScientific Reports10.1038/s41598-024-53811-914:1Online publication date: 28-Feb-2024
  • (2024)»Relationships are Key« A Semantic Relationship Awareness Framework for Operational Technology MonitoringSN Computer Science10.1007/s42979-024-03071-15:6Online publication date: 8-Aug-2024
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Communications of the ACM
Communications of the ACM  Volume 57, Issue 5
May 2014
110 pages
ISSN:0001-0782
EISSN:1557-7317
DOI:10.1145/2594413
  • Editor:
  • Moshe Y. Vardi
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 2014
Published in CACM Volume 57, Issue 5

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article
  • Popular
  • Refereed

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)417
  • Downloads (Last 6 weeks)75
Reflects downloads up to 14 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)PROV-IO$^+$+: A Cross-Platform Provenance Framework for Scientific Data on HPC SystemsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.337455535:5(844-861)Online publication date: 14-Mar-2024
  • (2024)Bashing irreproducibility with shournalScientific Reports10.1038/s41598-024-53811-914:1Online publication date: 28-Feb-2024
  • (2024)»Relationships are Key« A Semantic Relationship Awareness Framework for Operational Technology MonitoringSN Computer Science10.1007/s42979-024-03071-15:6Online publication date: 8-Aug-2024
  • (2024)ProBee: A Provenance-based Design for an Educational Game Analytics ModelTechnology, Knowledge and Learning10.1007/s10758-024-09758-xOnline publication date: 4-Jul-2024
  • (2023)A survey of provenance in scientific workflowJournal of High Speed Networks10.3233/JHS-22201729:2(129-145)Online publication date: 21-Apr-2023
  • (2023)Seven ways to make a data science project failData and Information Management10.1016/j.dim.2023.1000297:1(100029)Online publication date: Mar-2023
  • (2023)Towards Automating Semantic Relationship Awareness in Operational Technology MonitoringFuture Data and Security Engineering. Big Data, Security and Privacy, Smart City and Industry 4.0 Applications10.1007/978-981-99-8296-7_39(545-555)Online publication date: 17-Nov-2023
  • (2022)PROV-IOProceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing10.1145/3502181.3531477(213-226)Online publication date: 27-Jun-2022
  • (2022)Integrating Provenance Capture and UML With UML2PROV: Principles and ExperienceIEEE Transactions on Software Engineering10.1109/TSE.2020.297701648:1(53-68)Online publication date: 1-Jan-2022
  • (2022)PACED: Provenance-based Automated Container Escape Detection2022 IEEE International Conference on Cloud Engineering (IC2E)10.1109/IC2E55432.2022.00035(261-272)Online publication date: Sep-2022
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Digital Edition

View this article in digital edition.

Digital Edition

Magazine Site

View this article on the magazine site (external)

Magazine Site

Login options

Full Access

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media