Development process of the operational version of PDQM

  • Authors:
  • Angélica Caro;Coral Calero;Mario Piattini

  • Affiliations:
  • Department of Computer Science and Information Technologies, University of Bio Bio, Chillán, Chile;Alarcos Research Group, Information Systems and Technologies Department, INDRA Research and Development Institute, University of Castilla-La Mancha;Alarcos Research Group, Information Systems and Technologies Department, INDRA Research and Development Institute, University of Castilla-La Mancha

  • Venue:
  • WISE'07 Proceedings of the 8th international conference on Web information systems engineering
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

PDQM is a web portal data quality model. This model is centered on the data consumer perspective and for its construction we have developed a process which is divided into two parts. In the first part we defined the theoretical version of PDQM and as a result a set of 33 data quality attributes that can be used to evaluate the data quality in portals were identified. The second part consisted of the conversion of PDQM into an operational model. For this, we adopted a probabilistic approach by using Bayesian networks. In this paper, we show the development of this second part, which was divided into four phases: (1) Definition of a criterion to organize the PDQM's attributes, (2) Generation of a Bayesian network to represent PDQM, (3) Definition of measures and the node probability tables for the Bayesian network and (4) The validation of PDQM.