Supporting Fine-grained Data Lineage in a Database Visualization Environment
ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Why and Where: A Characterization of Data Provenance
ICDT '01 Proceedings of the 8th International Conference on Database Theory
Scientific Workflow Management by Database Management
SSDBM '98 Proceedings of the 10th International Conference on Scientific and Statistical Database Management
Chimera: AVirtual Data System for Representing, Querying, and Automating Data Derivation
SSDBM '02 Proceedings of the 14th International Conference on Scientific and Statistical Database Management
Using Object Deputy Model to Prepare Data for Data Warehousing
IEEE Transactions on Knowledge and Data Engineering
Integrating databases and workflow systems
ACM SIGMOD Record
Scientific workflow management and the Kepler system: Research Articles
Concurrency and Computation: Practice & Experience - Workflow in Grid Systems
Object Deputy Database Language
C5 '06 Proceedings of the Fourth International Conference on Creating, Connecting and Collaborating through Computing
GridDB: a data-centric overlay for scientific grids
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
An annotation management system for relational databases
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Metadata in the collaboratory for multi-scale chemical science
DCMI '03 Proceedings of the 2003 international conference on Dublin Core and metadata applications: supporting communities of discourse and practice---metadata research & applications
Hi-index | 0.00 |
There is a critical need to automatically manage large volumes of scientific data and applications in scientific workflows. Database technologies seem to be well suited to handle highly complex data managements. However, most of the workflow management systems (WFMSs) only utilize database technologies to a limited extent. In this paper, we present a DB-integrated scientific workflow framework which adopts the object deputy model to describe the execution of a series of scientific tasks. This framework allows WFMS management operations to be performed in a way analogous to traditional data management operations. Most important of all, data provenance method of this framework can provide much higher performance than other methods. Three kinds of schemas for data provenance are proposed and performance for each schema is analyzed in this paper.