Taxonomy of XML schema languages using formal language theory

  • Authors:
  • Makoto Murata;Dongwon Lee;Murali Mani;Kohsuke Kawaguchi

  • Affiliations:
  • IBM Tokyo Research Lab, Kanagawa-ken, Japan;Penn State University;Worcester Polytechnic Institute;Sun Microsystems

  • Venue:
  • ACM Transactions on Internet Technology (TOIT)
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

On the basis of regular tree grammars, we present a formal framework for XML schema languages. This framework helps to describe, compare, and implement such schema languages in a rigorous manner. Our main results are as follows: (1) a simple framework to study three classes of tree languages (local, single-type, and regular); (2) classification and comparison of schema languages (DTD, W3C XML Schema, and RELAX NG) based on these classes; (3) efficient document validation algorithms for these classes; and (4) other grammatical concepts and advanced validation algorithms relevant to an XML model (e.g., binarization, derivative-based validation).