An automatic load/extract scheme for XML documents through object-relational repositories

  • Authors:
  • Frank S. C. Tseng;Wen-Jong Hwung

  • Affiliations:
  • Department of Information Management, National Kaohsiung First University of Science and Technology, Kaohsiung 824, Taiwan, ROC;Department of Information Management, National Kaohsiung First University of Science and Technology, Kaohsiung 824, Taiwan, ROC

  • Venue:
  • Journal of Systems and Software
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Extensible markup language (XML), a simplified version of standard generalized markup language (SGML), is designed to enable electronic text interchange in the Internet. XML documents have a rigorously described structure that may be analyzed by computers and easily understood by humans. Most current approaches store XML documents in file systems or in relational database systems. However, the nature and the design of file system or relational database schema may cause limitations on fitting with XML document structure. In this paper, we present an automatic load/extract scheme to store and retrieve XML documents through object-relational databases. We propose an architecture, called XML meta-generator (XMG), which, after reading a specific document type definition (DTD), automatically generates the corresponding object-relational database schema (OR-Schema), a DI-Decomposer and a DI-Reconstructor, which are explained as follows: 1. OR-Schema--an object-relational database schema in UniSQL/X format for a specific DTD. 2. DI-Decomposer--a module decomposes XML document instances (Dis) according to the specific DTD format and stores the elements into the corresponding object-relational database. 3. DI-Reconstructor--a module retrieves elements from the object-relational database and reconstructs it to recover the original DI.These modules make XML documents be automatically decomposed into and reconstructed from object-relational databases in a seamless manner. Moreover, documents stored in the object-relational databases can be managed and inquired more easily than it could be in file systems or relational databases. Useful applications on various documents can also be easily built on top of the target database, such as digital libraries, data warehouses, and data or text mining systems.