Efficient processing of XPath queries using indexes

Authors:
Sanjay Madria;Yan Chen;Kalpdrum Passi;Sourav Bhowmick
Affiliations:
Department of Computer Science, University of Missouri-Rolla, Rolla, MO 65409, USA;Department of Computer Science, University of Missouri-Rolla, Rolla, MO 65409, USA;Department of Math & CS, Laurentian University, Sudbury, Ont., Canada;School of Computer Engineering, Nanyang Technological University, Singapore 639798, Singapore
Venue:
Information Systems
Year:
2007

Citing 13
Cited 1

Lore: a database management system for semistructured data

ACM SIGMOD Record
A query language for XML

WWW '99 Proceedings of the eighth international conference on World Wide Web
XML-GL: a graphical language for querying and restructuring XML documents

WWW '99 Proceedings of the eighth international conference on World Wide Web
Covering indexes for branching path queries

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Holistic twig joins: optimal XML pattern matching

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Indexing and Querying XML Data for Regular Path Expressions

Proceedings of the 27th International Conference on Very Large Data Bases
A Fast Index for Semistructured Data

Proceedings of the 27th International Conference on Very Large Data Bases
Quilt: An XML Query Language for Heterogeneous Data Sources

Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
ViST: a dynamic index method for querying XML data by tree structures

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
D(k)-index: an adaptive structural summary for graph-structured data

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Structural Joins: A Primitive for Efficient XML Query Pattern Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Exploiting Local Similarity for Indexing Paths in Graph-Structured Data

ICDE '02 Proceedings of the 18th International Conference on Data Engineering

Semantic optimization of XQuery by rewriting

ADBIS'09 Proceedings of the 13th East European conference on Advances in Databases and Information Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A number of indexing techniques have been proposed in recent times for optimizing the queries on XML and other semi-structured data models. Most of the semi-structured models use tree-like structures and query languages (XPath, XQuery, etc.) which make use of regular path expressions to optimize the query processing. In this paper, we propose two algorithms called Entry-point algorithm (EPA) and Two-point Entry algorithms that exploit different types of indices to efficiently process XPath queries. We discuss and compare two approaches namely, Root-first and Bottom-first in implementing the EPA. We present the experimental results of the algorithms using XML benchmark queries and data and compare the results with that of traditional methods of query processing with and without the use of indexes, and ToXin indexing approach. Our algorithms show improved performance results than the traditional methods and Toxin indexing approach.