On the efficient processing regular path expressions of an enormous volume of XML data

Authors:
Michal Krátký;Radim Ba?ca;Václav Snášel
Affiliations:
Department of Computer Science, VŠB - Technical University of Ostrava, Ostrava-Poruba, Czech Republic;Department of Computer Science, VŠB - Technical University of Ostrava, Ostrava-Poruba, Czech Republic;Department of Computer Science, VŠB - Technical University of Ostrava, Ostrava-Poruba, Czech Republic
Venue:
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Year:
2007

Citing 20
Cited 3

On supporting containment queries in relational database management systems

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
XRel: a path-based approach to storage and retrieval of XML documents using relational databases

ACM Transactions on Internet Technology (TOIT)
Algorithmics and applications of tree and graph searching

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Accelerating XPath location steps

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
APEX: an adaptive path index for XML data

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Storing and querying ordered XML using a relational database system

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
R-trees: a dynamic index structure for spatial searching

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
A general technique for querying XML documents using a relational database system

ACM SIGMOD Record
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Indexing and Querying XML Data for Regular Path Expressions

Proceedings of the 27th International Conference on Very Large Data Bases
The Universal B-Tree for Multidimensional Indexing: general Concepts

WWCA '97 Proceedings of the International Conference on Worldwide Computing and Its Applications
ViST: a dynamic index method for querying XML data by tree structures

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Structural Joins: A Primitive for Efficient XML Query Pattern Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
BLAS: an efficient XPath processing system

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Index Structures for Matching XML Twigs Using Relational Query Processors

ICDEW '05 Proceedings of the 21st International Conference on Data Engineering Workshops
Efficient Processing of Narrow Range Queries in Multi-dimensional Data Structures

IDEAS '06 Proceedings of the 10th International Database Engineering and Applications Symposium
Efficient XML-to-SQL query translation: where to add the intelligence?

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Improving the efficiency of XPath execution on relational systems

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Implementation of XPath axes in the multi-dimensional approach to indexing XML data

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
A path-based labeling scheme for efficient structural join

XSym'05 Proceedings of the Third international conference on Database and XML Technologies

Optimization of disk accesses for multidimensional range queries

DEXA'10 Proceedings of the 21st international conference on Database and expert systems applications: Part I
Multidimensional implementation of stream ADT

WAIM'11 Proceedings of the 2011 international conference on Web-Age Information Management
Optimal and efficient generalized twig pattern processing: a combination of preorder and postorder filterings

The VLDB Journal — The International Journal on Very Large Data Bases

Quantified Score

Hi-index	0.00

Visualization

Abstract

XML (Extensible Mark-up Language) has recently been embraced as a new approach to data modeling. Nowadays, more and more information is formatted as semi-structured data, i.e. articles in a digital library, documents on the web and so on. Implementation of an efficient system enabling storage and querying of XML documents requires development of new techniques. The indexing of an XML document is enabled by providing an efficient evaluation of a user query. XML query languages, like XPath or XQuery, apply a form of path expressions for composing more general queries. The evaluation process of regular path expressions is not efficient enough using the current approaches to indexing XML data. Most approaches index single elements and the query statement is processed by joining individual expressions. In this article we will introduce an approach which makes it possible to efficiently process a query defined by regular path expressions. This approach indexes all root-to-leaf paths and stores them in multi-dimensional data structures, allowing the indexing and efficient querying of an enormous volume of XML data.