Indexing Semistructured Data Using PATRICIA Tree

  • Authors:
  • Li-Cheng Wu;Jorng-Tzong Horng;Baw-Jhiune Liu;Chin-Yea Wang;Gwo-Dong Chen

  • Affiliations:
  • -;-;-;-;-

  • Venue:
  • DEXA '00 Proceedings of the 11th International Conference on Database and Expert Systems Applications
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information on the Web like HTML documents with images, video, and sound is a collection of heterogeneous data. HTML documents are semistructured in nature. Semistructured data are used to describe those structures which are less rigid or regular than those data found in standard database systems. This study presents a novel means of using Patricia Tree [14] to index semistructured data. This index is used by transferring the query into a regular expression and querying the regular expression over the Patricia Tree. The highlight of this approach is supporting query on content and structure simultaneously, while also supporting fast query time on long path and regular expressions.