A survey of data provenance in e-science
ACM SIGMOD Record
Provenance Provisioning in Mobile Agent-Based Distributed Job Workflow Execution
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007
Layering in provenance systems
USENIX'09 Proceedings of the 2009 conference on USENIX Annual technical conference
Designing a step-by-step user interface for finding provenance information over linked data
ICWE'11 Proceedings of the 11th international conference on Web engineering
Provenance explorer – customized provenance views using semantic inferencing
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Meta-line: lineage information for improved metadata quality
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
Evaluation of a Hybrid Approach for Efficient Provenance Storage
ACM Transactions on Storage (TOS)
Hi-index | 0.00 |
As peer-to-peer dissemination of custom data productsevolves among Earth science research groups,investigators and data managers must consider how tocompose appropriate metadata for their researchcomputing activities. Because workflows may spanmultiple groups, it is critical that lineage (provenance)metadata also be assembled to document and preservethe origins and processing history of constituent dataproducts and transformations for future data consumers.To demonstrate methods for composing lineage metadatafor custom processing, we introduce our terminology forworkflow and employ a case study for the creation ofsatellite-derived ocean color data products. Our examplecontributes to a general metadata model for workflowthat incorporates lineage. We then discuss metadatarequirements for remote sensing-related data products.We propose two techniques for composing lineagemetadata, both based on accessory XML metadatadocuments that are paired with the data products andversioned data transformations they describe. The firsttechnique, implemented as a prototype, features adedicated lineage server that introduces the indirectionand flexibility necessary for Web-based lineagenavigation. The second, more promising techniqueinvolves defining a simple Resource DescriptionFramework (RDF) vocabulary for lineage metadata, andusing extant RDF/XML tools for query and navigation.These methods provide guidelines for composing lineagemetadata that are applicable to other domains.