Lineage tracing for general data warehouse transformations
The VLDB Journal — The International Journal on Very Large Data Bases
XPath processing in a nutshell
ACM SIGMOD Record
Kepler: An Extensible System for Design and Execution of Scientific Workflows
SSDBM '04 Proceedings of the 16th International Conference on Scientific and Statistical Database Management
XML data exchange: consistency and query answering
Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Data exchange: semantics and query answering
Theoretical Computer Science - Database theory
VisTrails: visualization meets data management
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
Update exchange with mappings and provenance
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Provenance and scientific workflows: challenges and opportunities
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Provenance for Computational Tasks: A Survey
Computing in Science and Engineering
On the expressiveness of implicit provenance in query and update languages
ACM Transactions on Database Systems (TODS)
Efficient provenance storage over nested data collections
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Proceedings of the twenty-eighth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Techniques for efficiently querying scientific workflow provenance graphs
Proceedings of the 13th International Conference on Extending Database Technology
Fine-grained and efficient lineage querying of collection-based workflow provenance
Proceedings of the 13th International Conference on Extending Database Technology
RDFProv: A relational RDF store for querying and managing scientific workflow provenance
Data & Knowledge Engineering
On Provenance of Queries on Semantic Web Data
IEEE Internet Computing
The Open Provenance Model core specification (v1.1)
Future Generation Computer Systems
Putting lipstick on pig: enabling database-style workflow provenance
Proceedings of the VLDB Endowment
Labeling workflow views with fine-grained dependencies
Proceedings of the VLDB Endowment
Towards integrating workflow and database provenance
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
Hi-index | 0.00 |
We present a new provenance model for generating fine-grained data and service dependencies within XML data processing workflows. Our approach follows the widely used black box transformation semantics [15] in which service components produce new outputs from their inputs (without transformation). The heart of the model are data dependency rules which are evaluated on XML documents assembling all data produced by some workflow execution (similar to nested collections [5]). Dependency rules are defined in XPath extended with variables and can directly be compiled into XQuery expressions for generating provenance information in RDF-PROV [8]. We also present an implementation of our model, using the WebLab platform [19], showing step-by-step how our model works in a typical media mining use-case.