Inferring fine-grained data provenance in stream data processing: reduced storage cost, high accuracy

  • Authors:
  • Mohammad Rezwanul Huq;Andreas Wombacher;Peter M. G. Apers

  • Affiliations:
  • University of Twente, The Netherlands;University of Twente, The Netherlands;University of Twente, The Netherlands

  • Venue:
  • DEXA'11 Proceedings of the 22nd international conference on Database and expert systems applications - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Fine-grained data provenance ensures reproducibility of results in decision making, process control and e-science applications. However, maintaining this provenance is challenging in stream data processing because of its massive storage consumption, especially with large overlapping sliding windows. In this paper, we propose an approach to infer fine-grained data provenance by using a temporal data model and coarse-grained data provenance of the processing. The approach has been evaluated on a real dataset and the result shows that our proposed inferring method provides provenance information as accurate as explicit fine-grained provenance at reduced storage consumption.