SPADE: support for provenance auditing in distributed environments
Proceedings of the 13th International Middleware Conference
Declaratively processing provenance metadata
TaPP'13 Proceedings of the 5th USENIX conference on Theory and Practice of Provenance
Declaratively processing provenance metadata
Proceedings of the 5th USENIX Workshop on the Theory and Practice of Provenance
Hi-index | 0.00 |
Reproducibility has been a cornerstone of the scientific method for hundreds of years. The range of sources from which data now originates, the diversity of the individual manipulations performed, and the complexity of the orchestrations of these operations all limit the reproducibility that a scientist can ensure solely by manually recording their actions. We use an architecture where aggregation, fusion, and composition policies define how provenance records can be automatically merged to facilitate the analysis and reproducibility of experiments. We show that the overhead of collecting and storing provenance metadata can vary dramatically depending on the policy used to integrate it.