Assessing Data Quality for Information Products: Impact of Selection, Projection, and Cartesian Product

  • Authors:
  • Amir Parssian;Sumit Sarkar;Varghese S. Jacob

  • Affiliations:
  • -;-;-

  • Venue:
  • Management Science
  • Year:
  • 2004

Quantified Score

Hi-index 0.01

Visualization

Abstract

The cost associated with making decisions based on poor-quality data is quite high. Consequently, the management of data quality and the quality of associated data management processes has become critical for organizations. An important first step in managing data quality is the ability to measure the quality of information products (derived data) based on the quality of the source data and associated processes used to produce the information outputs. We present a methodology to determine two data quality characteristics--accuracy and completeness--that are of critical importance to decision makers. We examine how the quality metrics of source data affect the quality for information outputs produced using the relational algebra operations selection, projection, and Cartesian product. Our methodology is general, and can be used to determine how quality characteristics associated with diverse data sources affect the quality of the derived data.