Measuring referential integrity in distributed databases

  • Authors:
  • Carlos Ordonez;Javier García-García;Zhibo Chen

  • Affiliations:
  • University of Houston, Houston, TX;UNAM University, Mexico City, Mexico;University of Houston, Houston, TX

  • Venue:
  • Proceedings of the ACM first workshop on CyberInfrastructure: information management in eScience
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Distributed relational databases are used by different organizations located at multiple sites that work together on common projects. In this article, we focus on distributed relational databases with incomplete and inconsistent content. We propose to measure referential integrity errors in them for integration and interoperability purposes. We propose local and global referential integrity metrics at three levels: column, table and database. We assume each table can be asynchronously updated at any site and new records are periodically broadcasted to all sites. We explain several distributed query optimization issues. Our proposal is useful in database integration, multiple database interoperability and data quality assurance. We discuss applications of our proposal in distributed scientific databases.