Why and Where: A Characterization of Data Provenance
ICDT '01 Proceedings of the 8th International Conference on Database Theory
Lineage tracing for general data warehouse transformations
The VLDB Journal — The International Journal on Very Large Data Bases
A survey of data provenance in e-science
ACM SIGMOD Record
Pattern Recognition and Machine Learning (Information Science and Statistics)
Pattern Recognition and Machine Learning (Information Science and Statistics)
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Automatic capture and efficient storage of e-Science experiment provenance
Concurrency and Computation: Practice & Experience - The First Provenance Challenge
Analysis of Computer and Communication Networks
Analysis of Computer and Communication Networks
Provenance in Sensornet Republishing
Provenance and Annotation of Data and Processes
Facilitating fine grained data provenance using temporal data model
Proceedings of the Seventh International Workshop on Data Management for Sensor Networks
LIVE: a lineage-supported versioned DBMS
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
Adaptive Inference of Fine-grained Data Provenance to Achieve High Accuracy at Lower Storage Costs
ESCIENCE '11 Proceedings of the 2011 IEEE Seventh International Conference on eScience
ProvenanceCurious: a tool to infer data provenance from scripts
Proceedings of the 16th International Conference on Extending Database Technology
Hi-index | 0.00 |
Many applications facilitate a data processing chain, i.e. a workflow, to process data. Results of intermediate processing steps may not be persistent since reproducing these results are not costly and these are hardly re-usable. However, in stream data processing where data arrives continuously, documenting fine-grained provenance explicitly for a processing chain to reproduce results is not a feasible solution since the provenance data may become a multiple of the actual sensor data. In this paper, we propose the multi-step provenance inference technique that infers provenance data for the entire workflow with non-materialized intermediate views. Our solution provides high quality provenance graph.