Provenance for the cloud

Authors:
Kiran-Kumar Muniswamy-Reddy;Peter Macko;Margo Seltzer
Affiliations:
Harvard School of Engineering and Applied Sciences;Harvard School of Engineering and Applied Sciences;Harvard School of Engineering and Applied Sciences
Venue:
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
Year:
2010

Citing 17
Cited 17

Software Configuration Management System Using Vesta (Monographs in Computer Science)

Software Configuration Management System Using Vesta (Monographs in Computer Science)
A Framework for Collecting Provenance in Data-Centric Scientific Workflows

ICWS '06 Proceedings of the IEEE International Conference on Web Services
Provenance-aware storage systems

ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
User-mode Linux

ALS '01 Proceedings of the 5th annual Linux Showcase & Conference - Volume 5
Pors: proofs of retrievability for large files

Proceedings of the 14th ACM conference on Computer and communications security
Provable data possession at untrusted stores

Proceedings of the 14th ACM conference on Computer and communications security
Automatic capture and reconstruction of computational provenance

Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Using provenance to aid in personal file search

ATC'07 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference
Building a database on S3

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Securing provenance

HOTSEC'08 Proceedings of the 3rd conference on Hot topics in security
The case of the fake Picasso: preventing history forgery with secure provenance

FAST '09 Proccedings of the 7th conference on File and storage technologies
Causality-based versioning

FAST '09 Proccedings of the 7th conference on File and storage technologies
The case for browser provenance

TAPP'09 First workshop on on Theory and practice of provenance
Making a cloud provenance-aware

TAPP'09 First workshop on on Theory and practice of provenance
Pregel: a system for large-scale graph processing - "ABSTRACT"

Proceedings of the 28th ACM symposium on Principles of distributed computing
Layering in provenance systems

USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
An experimentation workbench for replayable networking research

NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation

Trail of bytes: efficient support for forensic analysis

Proceedings of the 17th ACM conference on Computer and communications security
DepSky: dependable and secure storage in a cloud-of-clouds

Proceedings of the sixth conference on Computer systems
An internationally distributed cloud for science: the cloud-enabled space weather platform

Proceedings of the 2nd International Workshop on Software Engineering for Cloud Computing
Towards SIRF: self-contained information retention format

Proceedings of the 4th Annual International Conference on Systems and Storage
Intrusion recovery for database-backed web applications

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Securing data provenance in the cloud

iNetSec'11 Proceedings of the 2011 IFIP WG 11.4 international conference on Open Problems in Network Security
Provenance for system troubleshooting

LISA'11 Proceedings of the 25th international conference on Large Installation System Administration
Provenance as a security control

TaPP'12 Proceedings of the 4th USENIX conference on Theory and Practice of Provenance
Topological ordering algorithm for LDAG

Information Processing Letters
Towards secure provenance-based access control in cloud environments

Proceedings of the third ACM conference on Data and application security and privacy
SPADE: support for provenance auditing in distributed environments

Proceedings of the 13th International Middleware Conference
Towards design support for provenance awareness: a classification of provenance questions

Proceedings of the Joint EDBT/ICDT 2013 Workshops
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

ACM SIGOPS 24th Symposium on Operating Systems Principles
Tango: distributed data structures over a shared log

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
Exploring storage class memory with key value stores

Proceedings of the 1st Workshop on Interactions of NVM/FLASH with Operating Systems and Workloads
DepSky: Dependable and Secure Storage in a Cloud-of-Clouds

ACM Transactions on Storage (TOS)
Role of acquisition intervals in private and public cloud storage costs

Decision Support Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The cloud is poised to become the next computing environment for both data storage and computation due to its pay-as-you-go and provision-as-you-go models. Cloud storage is already being used to back up desktop user data, host shared scientific data, store web application data, and to serve web pages. Today's cloud stores, however, are missing an important ingredient: provenance. Provenance is metadata that describes the history of an object. We make the case that provenance is crucial for data stored on the cloud and identify the properties of provenance that enable its utility. We then examine current cloud offerings and design and implement three protocols for maintaining data/provenance in current cloud stores. The protocols represent different points in the design space and satisfy different subsets of the provenance properties. Our evaluation indicates that the overheads of all three protocols are comparable to each other and reasonable in absolute terms. Thus, one can select a protocol based upon the properties it provides without sacrificing performance. While it is feasible to provide provenance as a layer on top of today's cloud offerings, we conclude by presenting the case for incorporating provenance as a core cloud feature, discussing the issues in doing so.