Print: a provenance model to support integration processes

  • Authors:
  • Bruno Tomazela;Carmem S. Hara;Ricardo R. Ciferri;Cristina D.A. Ciferri

  • Affiliations:
  • University of São Paulo, São Carlos, Brazil;Federal University of Paraná, Curitiba, Brazil;Federal University of São Carlos, São Carlos, Brazil;University of São Paulo, São Carlos, Brazil

  • Venue:
  • CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In some integration applications, users are allowed to import data from heterogeneous sources, but are not allowed to update source data directly. Imported data may be inconsistent, and even when inconsistencies are detected and solved, these changes may not be propagated to the sources due to their update policies. Therefore, they continue to provide the same inconsistent data in the future until the proper authority updates them. In this paper, we propose PrInt, a model that supports user's decisions on cleaning data to be automatically reapplied in subsequent integration processes. By reproducing previous decisions, the user may focus only on new inconsistencies originated from source modified data. The reproducibility provided by PrInt is based on logging, and by incorporating data provenance in the integration process.