XML schema containment checking based on semi-implicit techniques

  • Authors:
  • Akihiko Tozawa;Masami Hagiya

  • Affiliations:
  • IBM Research, Tokyo Research Laboratory, IBM Japan ltd., Japan;Graduate School of Information Science and Technology, University of Tokyo, Japan

  • Venue:
  • CIAA'03 Proceedings of the 8th international conference on Implementation and application of automata
  • Year:
  • 2003

Quantified Score

Hi-index 0.01

Visualization

Abstract

XML schemas are computer languages defining grammars for XML (Extensible Markup Languages) documents. Containment checking for XML schemas has many applications, and is thus important. Since XML schemas are related to the class of tree regular languages, their containment checking is reduced to the language containment problem for non-deterministic tree automata (NTAs). However, an NTA for a practical XML schema has 102-103 states for which the textbook algorithm based on naive determinization is expensive. Thus we in this paper consider techniques based on BDDs (binary decision diagrams). We used semi-implicit encoding which encodes a set of subsets of states as a BDD, rather than encoding a set of states by it. The experiment on several real-world XML schemas proves that our containment checker can answer problems that cannot be solved by previously known algorithms.