A first course in database systems
A first course in database systems
Why and Where: A Characterization of Data Provenance
ICDT '01 Proceedings of the 8th International Conference on Database Theory
Chimera: AVirtual Data System for Representing, Querying, and Automating Data Derivation
SSDBM '02 Proceedings of the 14th International Conference on Scientific and Statistical Database Management
K2/Kleisli and GUS: experiments in integrated access to genomic data sources
IBM Systems Journal - Deep computing for the life sciences
An approach for pipelining nested collections in scientific workflows
ACM SIGMOD Record
A survey of data provenance in e-science
ACM SIGMOD Record
Provenance management in curated databases
Proceedings of the 2006 ACM SIGMOD international conference on Management of data
An annotation management system for relational databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Contextualised workflow execution in mygrid
EGC'05 Proceedings of the 2005 European conference on Advances in Grid Computing
An environment to define and execute in-silico workflows using web services
DILS'05 Proceedings of the Second international conference on Data Integration in the Life Sciences
Actor-oriented design of scientific workflows
ER'05 Proceedings of the 24th international conference on Conceptual Modeling
A model for user-oriented data provenance in pipelined scientific workflows
IPAW'06 Proceedings of the 2006 international conference on Provenance and Annotation of Data
Zoom*UserViews: querying relevant provenance in workflow systems
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
OrthoSearch: a scientific workflow approach to detect distant homologies on protozoans
Proceedings of the 2008 ACM symposium on Applied computing
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Provenance and scientific workflows: challenges and opportunities
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
A Dataflow-Oriented Atomicity and Provenance System for Pipelined Scientific Workflows
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Atomicity and provenance support for pipelined scientific workflows
Future Generation Computer Systems
Efficiently discovering critical workflows in scientific explorations
Future Generation Computer Systems
Efficient provenance storage over nested data collections
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Purple SOX extraction management system
ACM SIGMOD Record
Detecting and resolving unsound workflow views for correct provenance analysis
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Tracking Files in the Kepler Provenance Framework
SSDBM 2009 Proceedings of the 21st International Conference on Scientific and Statistical Database Management
A navigation model for exploring scientific workflow provenance graphs
Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science
A characterization of the problem of secure provenancemanagement
ISI'09 Proceedings of the 2009 IEEE international conference on Intelligence and security informatics
A formal model of dataflow repositories
DILS'07 Proceedings of the 4th international conference on Data integration in the life sciences
Detecting distant homologies on protozoans metabolic pathways using scientific workflows
International Journal of Data Mining and Bioinformatics
Searching workflows with hierarchical views
Proceedings of the VLDB Endowment
The Foundations for Provenance on the Web
Foundations and Trends in Web Science
Generating sound workflow views for correct provenance analysis
ACM Transactions on Database Systems (TODS)
Human-assisted graph search: it's okay to ask questions
Proceedings of the VLDB Endowment
ProvManager: a provenance management system for scientific workflows
Concurrency and Computation: Practice & Experience
A PROV encoding for provenance analysis using deductive rules
IPAW'12 Proceedings of the 4th international conference on Provenance and Annotation of Data and Processes
A comprehensive model for provenance
ER'12 Proceedings of the 2012 international conference on Advances in Conceptual Modeling
Declaratively processing provenance metadata
TaPP'13 Proceedings of the 5th USENIX conference on Theory and Practice of Provenance
Declaratively processing provenance metadata
Proceedings of the 5th USENIX Workshop on the Theory and Practice of Provenance
Automated data provenance capture in spreadsheets, with case studies
Future Generation Computer Systems
Hi-index | 0.00 |
Scientific experiments are becoming increasingly large and complex, with a commensurate increase in the amount and complexity of data generated. Data, both intermediate and final results, is derived by chaining and nesting together multiple database searches and analytical tools. In many cases, the means by which the data are produced is not known, making the data difficult to interpret and the experiment impossible to reproduce. Provenance in scientific workflows is thus of paramount importance. In this paper, we provide a formal model of provenance for scientific workflows which is general (i.e. can be used with existing workflow systems, such as Kepler, myGrid and Chimera) and sufficiently expressive to answer the provenance queries we encountered in a number of case studies. Interestingly, our model not only takes into account the chained and nested structure of scientific workflows, but allows asks for provenance at different levels of abstraction (user views).