Reconstructing provenance

  • Authors:
  • Sara Magliacane

  • Affiliations:
  • Department of Computer Science, VU University, Amsterdam, The Netherlands

  • Venue:
  • ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Provenance is an increasingly important aspect of data management that is often underestimated and neglected by practitioners. In our work, we target the problem of reconstructing provenance of files in a shared folder setting, assuming that only standard filesystem metadata are available. We propose a content-based approach that is able to reconstruct provenance automatically, leveraging several similarity measures and edit distance algorithms, adapting and integrating them into a multi-signal pipeline. We discuss our research methodology and show some promising preliminary results.