Data quality inference

  • Authors:
  • Raymond K. Pon;Alfonso F. Cárdenas

  • Affiliations:
  • UCLA Computer Science, Los Angeles, CA;UCLA Computer Science, Los Angeles, CA

  • Venue:
  • Proceedings of the 2nd international workshop on Information quality in information systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the field of sensor networks, data integration and collaboration, and intelligence gathering efforts, information on the quality of data sources are important but are often not available. We describe a technique to rank data sources by observing and comparing their behavior (i.e., the data produced by data sources) to rank. Intuitively, our measure characterizes data sources that agree with accurate or high-quality data sources as likely accurate. Furthermore, our measure includes a temporal component that takes into account a data source's past accuracy in evaluating its current accuracy. Initial experimental results based on simulation data to support our hypothesis demonstrate high precision and recall on identifying the most accurate data sources.