Designing and mining multi-terabyte astronomy archives: the Sloan Digital Sky Survey
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
A Framework for Collecting Provenance in Data-Centric Scientific Workflows
ICWS '06 Proceedings of the IEEE International Conference on Web Services
Provenance-aware storage systems
ATEC '06 Proceedings of the annual conference on USENIX '06 Annual Technical Conference
Special Issue: The First Provenance Challenge
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Issues in automatic provenance collection
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Segment-based recovery: write-ahead logging revisited
Proceedings of the VLDB Endowment
Provenance as first class cloud data
ACM SIGOPS Operating Systems Review
FAST'10 Proceedings of the 8th USENIX conference on File and storage technologies
Trusted computing and provenance: better together
TAPP'10 Proceedings of the 2nd conference on Theory and practice of provenance
Trustworthy information: concepts and mechanisms
WAIM'10 Proceedings of the 11th international conference on Web-age information management
Provenance for system troubleshooting
LISA'11 Proceedings of the 25th international conference on Large Installation System Administration
A Provenance-based Adaptive Scheduling Heuristic for Parallel Scientific Workflows in Clouds
Journal of Grid Computing
SPADE: support for provenance auditing in distributed environments
Proceedings of the 13th International Middleware Conference
Towards design support for provenance awareness: a classification of provenance questions
Proceedings of the Joint EDBT/ICDT 2013 Workshops
Role of acquisition intervals in private and public cloud storage costs
Decision Support Systems
Hi-index | 0.00 |
The advent of cloud computing provides a cheap and convenient mechanism for scientists to share data. The utility of such data is obviously enhanced when the provenance of the data is also available. The cloud, while convenient for storing data, is not designed for storing and querying provenance. In this paper, we present desirable properties for distributed provenance storage systems and present design alternatives for storing data and provenance on Amazon's popular Web Services platform (AWS). We evaluate the properties satisfied by each approach and analyze the cost of storing and querying provenance in each approach.