Index structures for structured documents
Proceedings of the first ACM international conference on Digital libraries
Using XML for Supplemental Hypertext Support
Information Technology and Management
Multimedia Tools and Applications
Towards integrating hypermedia and information systems on the web
Information and Management
Hi-index | 0.03 |
In this paper, we present a document model which integrates the logical structure and hypertext link structure of hyperdocuments in order to manage structured documents with hypertext links. Based on this model we define a new structure query language which expresses the structure query using path expressions. To process a structure query in a document management system which represents structure information as database relations, costly join operations are used to find a relationship between elements in a document hierarchy. In order to overcome this problem, schemes based on the parse tree and element locator have been used. In this paper, we propose a new structure query processing scheme that uses unique element identifiers (UID's) to evaluate structure queries. Our scheme has advantage over previous schemes since it can obtain the UID's of the ancestors and descendents directly from the UID of a node without disk access. We present relational database schemas for our scheme as well as others and compare the query processing costs.In order to support direct access to a document element, keyword indices to it should be provided. We propose three kinds of inverted index structures for efficient structure query processing.