Managerial decision support with knowledge of accuracy and completeness of the relational aggregate functions

  • Authors:
  • Amir Parssian

  • Affiliations:
  • University of Illinois at Springfield, One University Plaza, Springfield

  • Venue:
  • Decision Support Systems
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Aggregate data produced by decision support systems is utilized by managers in their decision making process to run or improve their firm's operations. Often, data residing in corporate databases and data warehouses are far from being perfect, and their imperfections have an impact on decision quality and outcome. Therefore, having knowledge about the effect of data errors on aggregate data could lead to more informed decisions, reduced risks, and competitive advantage. In this paper, we present a methodology to estimate the effects of data accuracy and completeness, as two important data quality dimensions, on the relational aggregate functions Count, Sum, Average, Max, and Min. Our methodology defines a set of attribute value types and deploys sampling strategies to determine the maximum likelihood estimates of each value type. We show the effect of data error rates on the scalar values returned by the aggregate functions and demonstrate the efficiency of our estimates by Monte Carlo simulations.