Accuracy of aggregate data in distributed project settings: Model, analysis and implications

  • Authors:
  • Nitin R. Joglekar;Edward G. Anderson;G. Shankaranarayanan

  • Affiliations:
  • Boston University, Boston, MA;University of Texas at Austin, Austin, TX;Babson College, Babson Park, MA

  • Venue:
  • Journal of Data and Information Quality (JDIQ)
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We examine the management of data accuracy in inter-organizational data exchanges using the context of distributed software projects. Organizations typically manage projects by outsourcing portions of the project to partners. Managing a portfolio of such projects requires sharing data regarding the status of work-in-progress residing with the partners and estimates of these projects' completion times. Portfolio managers use these data to assign projects to be outsourced to partners. These data are rarely accurate. Unless these data are filtered, inaccuracies can lead to myopic and expensive sourcing decisions. We develop a model that uses project-status data to identify an optimal assignment of projects to be outsourced. This model permits corruption of project-status data. We use this model to compute the costs of using perfect versus inaccurate project-status data and show that the costs of deviation from optimal are sizable when the inaccuracy in the data is significant. We further propose a filter to correct inaccurate project-status data and generate an estimate of true progress. With this filter, depending on the relative magnitudes of errors, we show that accuracy of project-status data can be improved and the associated economic benefit is significant. We illustrate the improvement in accuracy and associated economic benefit by instantiating the model and the filter. We further elaborate on how the model parameters may be estimated and used in practice.