Principles of programming with complex objects and collection types
ICDT '92 Selected papers of the fourth international conference on Database theory
Why and Where: A Characterization of Data Provenance
ICDT '01 Proceedings of the 8th International Conference on Database Theory
A survey of data provenance in e-science
ACM SIGMOD Record
Towards a Quality Model for Effective Data Selection in Collaboratories
ICDEW '06 Proceedings of the 22nd International Conference on Data Engineering Workshops
VisTrails: visualization meets data management
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
A Framework for Collecting Provenance in Data-Centric Scientific Workflows
ICWS '06 Proceedings of the IEEE International Conference on Web Services
Provenance-based validation of e-science experiments
Web Semantics: Science, Services and Agents on the World Wide Web
Taverna Workflows: Syntax and Semantics
E-SCIENCE '07 Proceedings of the Third IEEE International Conference on e-Science and Grid Computing
Databases with uncertainty and lineage
The VLDB Journal — The International Journal on Very Large Data Bases
Automatic annotation of Web services based on workflow definitions
ACM Transactions on the Web (TWEB)
Querying and Managing Provenance through User Views in Scientific Workflows
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
A formal model of dataflow repositories
DILS'07 Proceedings of the 4th international conference on Data integration in the life sciences
Provenance collection support in the kepler scientific workflow system
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Electronically querying for the provenance of entities
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Building Scientific Workflow with Taverna and BPEL: A Comparative Study in caGrid
Service-Oriented Computing --- ICSOC 2008 Workshops
Research issues in data provenance
Proceedings of the 48th Annual Southeast Regional Conference
Future Generation Computer Systems
Meta-line: lineage information for improved metadata quality
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
TaPP'12 Proceedings of the 4th USENIX conference on Theory and Practice of Provenance
Static compiler analysis for workflow provenance
WORKS '13 Proceedings of the 8th Workshop on Workflows in Support of Large-Scale Science
Hi-index | 0.00 |
The provenance, or lineage , of a workflow data product can be reconstructed by keeping a complete trace of workflow execution. This lineage information, however, is likely to be both imprecise, because of the black-box nature of the services that compose the workflow, and noisy, because of the many trivial data transformations that obscure the intended purpose of the workflow. In this paper we argue that these shortcomings can be alleviated by introducing a small set of optional lightweight annotations to the workflow, in a principled way. We begin by presenting a baseline, annotation-free lineage model for the Taverna workflow system, and then show how the proposed annotations improve the results of fundamental lineage queries.