Query generation for retrieving data from distributed semistructured documents using a metadata interface

  • Authors:
  • Guija Choe;Young-Kwang Nam;Joseph Goguen;Guilian Wang

  • Affiliations:
  • Department of Computer Science, Yonsei University, Wonju, Korea;Department of Computer Science, Yonsei University, Wonju, Korea;Department of Computer Science and Engineering, University of California at San Diego, USA;Department of Computer Science and Engineering, University of California at San Diego, USA

  • Venue:
  • Computer Languages, Systems and Structures
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe a method for generating queries for retrieving data from distributed heterogeneous semistructured documents, and its implementation in the metadata interface DDXMI (distributed document XML metadata interchange). The proposed system generates local queries appropriate to local schemas from a user query over the global schema. The system constructs mappings between global schema and local schemas (extracted from local documents if not given), path substitution, and node identification for resolving the heterogeneity among nodes with the same label that often exist in semistructured data. The system uses Quilt as its XML query language. An experiment is reported over three local semistructured documents: 'thesis', 'reports', and 'journal' documents with 'article' global schema. The prototype was developed under Windows system with Java and JavaCC.