LOP: capturing and linking open provenance on LOD cycle

  • Authors:
  • Rogers Reiche de Mendonça;Sérgio Manuel Serra da Cruz;Jonas F. S. M. De La Cerda;Maria Cláudia Cavalcanti;Kelli Faria Cordeiro;Maria Luiza M. Campos

  • Affiliations:
  • Universidade Federal do Rio de Janeiro (UFRJ);Universidade Federal Rural do Rio de Janeiro (UFRRJ);Instituto Militar de Engenharia (IME);Instituto Militar de Engenharia (IME);Universidade Federal do Rio de Janeiro (UFRJ);Universidade Federal do Rio de Janeiro (UFRJ)

  • Venue:
  • Proceedings of the Fifth Workshop on Semantic Web Information Management
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

The Web of Data has emerged as a means to expose, share, reuse, and connect information on the Web identified by URIs using RDF as a data model, following Linked Data Principles. However, the reuse of third party data can be compromised without proper data quality assessments. In this context, important questions emerge: how can one trust on published data and links? Which manipulation, modification and integration operations have been applied to the data before its publication? What is the nature of comparisons or transformations applied to data during the interlinking process? In this scenario, provenance becomes a fundamental element. In this paper, we describe an approach for generating and capturing Linked Open Provenance (LOP) to support data quality and trustworthiness assessments, which covers preparation and format transformation of traditional data sources, up to dataset publication and interlinking. The proposed architecture takes advantage of provenance agents, orchestrated by an ETL workflow approach, to collect provenance at any specified level and also link it with its corresponding data. We also describe a real use case scenario where the architecture was implemented to evaluate the proposal.