Consistent data for inconsistent XML document

  • Authors:
  • Zijing Tan;Zijun Zhang;Wei Wang;Baile Shi

  • Affiliations:
  • Department of Computing and Information Technology, University of Fudan, Shanghai 200433, P.R. China;Department of Computing and Information Technology, University of Fudan, Shanghai 200433, P.R. China;Department of Computing and Information Technology, University of Fudan, Shanghai 200433, P.R. China;Department of Computing and Information Technology, University of Fudan, Shanghai 200433, P.R. China

  • Venue:
  • Information and Software Technology
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

XML document may contain inconsistencies that violate predefined integrity constraints, which causes the data inconsistency problem. In this paper, we consider how to get the consistent data from an inconsistent XML document. There are two basic concepts for this problem: Repair is the data consistent with the integrity constraints, and also minimally differs from the original one. Consistent data is the data common for every possible repair. First we give a general constraint model for XML, which can express the commonly discussed integrity constraints, including functional dependencies, keys and multivalued dependencies. Next we provide a repair framework for inconsistent XML document with three basic update operations: node insertion, node deletion and node value modification. Following this approach, we introduce the concept of repair for inconsistent XML document, discuss the chase method to generate repairs, and prove some important properties of the chase. Finally we give a method to obtain the greatest lower bound of all possible repairs, which is sufficient for consistent data. We also implement prototypes of our method, and evaluate our framework and algorithms in the experiment.