A High Performance Schema-Specific XML Parser

  • Authors:
  • Zhenghong Gao;Yinfei Pan;Ying Zhang;Kenneth Chiu

  • Affiliations:
  • -;-;-;-

  • Venue:
  • E-SCIENCE '07 Proceedings of the Third IEEE International Conference on e-Science and Grid Computing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Performance of XML parsers with validation are usually suffer. This is because such parsers need first parsing and undertanding XML schemas, and thus are limited by the very complexity of XML schemas. Schema-specific approach, however, may adjust such problem. In this paper, we introduce a high performance SAX like validating XML parser using a schema-specific approach. In this approach, a schema compiler first transforms the schema into an intermediate representation, called generalized automata, which abstracts the computations required to parse XML documents as well as validate them against a schema. The generalized automaton is then translated to a schema specific parser, which is capable of parsing and validating XML documents with namespaces through a schema specific modified SAX API. Our performance evaluation shows good results when compared with other validating parsers.