Enforcing strictness in integration of dimensions: beyond instance matching

  • Authors:
  • Dariush Riazati;James A. Thom;Xiuzhen Zhang

  • Affiliations:
  • RMIT University, Melbourne, Australia;RMIT University, Melbourne, Australia;RMIT University, Melbourne, Australia

  • Venue:
  • Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Maintaining strictness in dimensions is important in integration of data warehouses. A dimension that satisfies all of its roll-up constraints is said to be strict, a property that is required for correct aggregation. Existing work on instance matching does not address the problem of enforcing the strictness of roll-up constraints. In this paper, we use a graph matching-based approach to dimension instance matching and propose an algorithm that enforces strictness and reduces false positives. Making use of similarity flooding, the graph matching algorithm can be greedy in identifying matching members, we propose heuristics to further reduce false positive matches and reduce false strictness. Experiments on real-world data demonstrates the effectiveness of our proposed approach.