Efficient structural joins on indexed XML documents

Authors:
Shu-Yao Chien;Zografoula Vagena;Donghui Zhang;Vassilis J. Tsotras;Carlo Zaniolo
Affiliations:
CS Dept., UCLA;CES Dept., UC Riverside;CES Dept., UC Riverside;CES Dept., UC Riverside;CS Dept., UCLA
Venue:
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Year:
2002

Citing 31
Cited 123

Two algorithms for maintaining order in a list

STOC '87 Proceedings of the nineteenth annual ACM symposium on Theory of computing
Access methods for multiversion data

SIGMOD '89 Proceedings of the 1989 ACM SIGMOD international conference on Management of data
The R*-tree: an efficient and robust access method for points and rectangles

SIGMOD '90 Proceedings of the 1990 ACM SIGMOD international conference on Management of data
Efficient processing of spatial joins using R-trees

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
The snapshot index: an I/O-optimal access method for timeslice queries

Information Systems
Change detection in hierarchically structured information

SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
A query language for XML

WWW '99 Proceedings of the eighth international conference on World Wide Web
XML-GL: a graphical language for querying and restructuring XML documents

WWW '99 Proceedings of the eighth international conference on World Wide Web
Comparison of access methods for time-evolving data

ACM Computing Surveys (CSUR)
XTRACT: a system for extracting document type descriptors from XML documents

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
On supporting containment queries in relational database management systems

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Accelerating XPath location steps

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Holistic twig joins: optimal XML pattern matching

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
R-trees: a dynamic index structure for spatial searching

SIGMOD '84 Proceedings of the 1984 ACM SIGMOD international conference on Management of data
An Efficient Multiversion Access Structure

IEEE Transactions on Knowledge and Data Engineering
Designing Access Methods for Bitemporal Databases

IEEE Transactions on Knowledge and Data Engineering
Representative Objects: Concise Representations of Semistructured, Hierarchial Data

ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
Optimizing Regular Path Expressions Using Graph Schemas

ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Spatial Joins Using R-trees: Breadth-First Traversal with Global Optimizations

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Query Optimization for XML

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Relational Databases for Querying XML Documents: Limitations and Opportunities

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Indexing and Querying XML Data for Regular Path Expressions

Proceedings of the 27th International Conference on Very Large Data Bases
A Fast Index for Semistructured Data

Proceedings of the 27th International Conference on Very Large Data Bases
MV3R-Tree: A Spatio-Temporal Access Method for Timestamp and Interval Queries

Proceedings of the 27th International Conference on Very Large Data Bases
Efficient Management of Multiversion Documents by Object Referencing

Proceedings of the 27th International Conference on Very Large Data Bases
Quilt: An XML Query Language for Heterogeneous Data Sources

Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Efficient Complex Query Support for Multiversion XML Documents

EDBT '02 Proceedings of the 8th International Conference on Extending Database Technology: Advances in Database Technology
An asymptotically optimal multiversion B-tree

The VLDB Journal — The International Journal on Very Large Data Bases
Structural Joins: A Primitive for Efficient XML Query Pattern Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Exploiting Local Similarity for Indexing Paths in Graph-Structured Data

ICDE '02 Proceedings of the 18th International Conference on Data Engineering

On labeling schemes for the semantic web

WWW '03 Proceedings of the 12th international conference on World Wide Web
Efficient management of XML documents

ADC '03 Proceedings of the 14th Australasian database conference - Volume 17
Querying structured text in an XML database

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
PRIX: Indexing And Querying XML Using Prüfer Sequences

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
BLAS: an efficient XPath processing system

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Efficient processing of XML twig queries with OR-predicates

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
On the integration of structure indexes and inverted lists

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Efficient structural joins with clustered extents

Information Processing Letters - Devoted to the rapid publication of short contributions to information processing
Twig query processing over graph-structured XML data

Proceedings of the 7th International Workshop on the Web and Databases: colocated with ACM SIGMOD/PODS 2004
A Fast Tree Pattern Matching Algorithm for XML Query

WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
EXPedite: a system for encoded XML processing

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Virtual cursors for XML joins

Proceedings of the thirteenth ACM international conference on Information and knowledge management
An evaluation of XML indexes for structural join

ACM SIGMOD Record
What makes the differences: benchmarking XML database implementations

ACM Transactions on Internet Technology (TOIT)
Efficient structural joins with on-the-fly indexing

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Lazy XML updates: laziness as a virtue, of update and structural join efficiency

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Querying and maintaining ordered XML data using relational databases

ADC '05 Proceedings of the 16th Australasian database conference - Volume 39
Tree-pattern queries on a lightweight XML processor

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Accelerating queries by pruning XML documents

Data & Knowledge Engineering
A path-based node filtering method for efficient structural joins

Information Processing Letters
Efficient indexing and querying of XML data using modified Prüfer sequences

Proceedings of the 14th ACM international conference on Information and knowledge management
Optimizing cursor movement in holistic twig joins

Proceedings of the 14th ACM international conference on Information and knowledge management
Supporting complex queries on multiversion XML documents

ACM Transactions on Internet Technology (TOIT)
Sequencing XML data and query twigs for fast pattern matching

ACM Transactions on Database Systems (TODS)
Integrating document and data retrieval based on XML

The VLDB Journal — The International Journal on Very Large Data Bases
TWIX: twig structure and content matching of selective queries using binary labeling

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Query optimization in XML structured-document databases

The VLDB Journal — The International Journal on Very Large Data Bases
Indexing graph-structured XML data for efficient structural join operation

Data & Knowledge Engineering
Indexing XML documents for XPath query processing in external memory

Data & Knowledge Engineering - Special issue: ER 2003
An efficient infrastructure for native transactional XML processing

Data & Knowledge Engineering
XML subtree reconstruction from relational storage of XML documents

Data & Knowledge Engineering
Indexing dataspaces

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Xpath on steroids: exploiting relational engines for xpath performance

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Adaptive Index Utilization in Memory-Resident Structural Joins

IEEE Transactions on Knowledge and Data Engineering
The dynamic predicate: integrating access control with query processing in XML databases

The VLDB Journal — The International Journal on Very Large Data Bases
Evaluating Universal Quantification in XML

IEEE Transactions on Knowledge and Data Engineering
Efficiently Querying Large XML Data Repositories: A Survey

IEEE Transactions on Knowledge and Data Engineering
Structural join and staircase join algorithms of sibling relationship

Journal of Computer Science and Technology
Projecting XML documents

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Mixed mode XML query processing

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Holistic twig joins on indexed XML documents

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Staircase join: teach a relational DBMS to watch its (axis) steps

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Dynamic interval-based labeling scheme for efficient XML query and update processing

Journal of Systems and Software
Schema-free XQuery

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
XQuery on SQL hosts

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Evaluation of partial path queries on xml data

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
LCS-TRIM: dynamic programming meets XML indexing and querying

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Sum-max monotonic ranked joins for evaluating top-k twig queries on weighted data graphs

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Temporal queries and version management in XML-based document archives

Data & Knowledge Engineering
Efficient updates in dynamic XML data: from binary string to quaternary string

The VLDB Journal — The International Journal on Very Large Data Bases
Enabling Schema-Free XQuery with meaningful query focus

The VLDB Journal — The International Journal on Very Large Data Bases
An efficient numbering scheme and query algorithms for XML

International Journal of Computational Science and Engineering
Efficient evaluation of generalized path pattern queries on XML data

Proceedings of the 17th international conference on World Wide Web
Faster path indexes for search in XML data

ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Computing structural similarity of source XML schemas against domain XML schema

ADC '08 Proceedings of the nineteenth conference on Australasian database - Volume 75
Warehousing complex data from the web

International Journal of Web Engineering and Technology
An Efficient XML Index Structure with Bottom-Up Query Processing

ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
A Glance on Current XML Twig Pattern Matching Algorithms

ICCSA '08 Proceedings of the international conference on Computational Science and Its Applications, Part II
Coding-based Join Algorithms for Structural Queries on Graph-Structured XML Document

World Wide Web
Hash-base subgraph query processing method for graph-structured XML documents

Proceedings of the VLDB Endowment
S3: Evaluation of tree-pattern XML queries supported by structural summaries

Data & Knowledge Engineering
RRSi: indexing XML data for proximity twig queries

Knowledge and Information Systems
Indexing Method for Transitive Relationships of Product Information

WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
An efficient XML encoding and labeling method for query processing and updating on dynamic XML data

Journal of Systems and Software
FMware: middleware for efficient filtering and matching of XML messages with local data

Proceedings of the ACM/IFIP/USENIX 2006 International Conference on Middleware
Query Optimization for Complex Path Queries on XML Data

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
Cost based plan selection for xpath

Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Answering XML queries using materialized views revisited

Proceedings of the 18th ACM conference on Information and knowledge management
A bi-labeling based XPath processing system

Information Systems
Optimizing updates of recursive XML views of relations

The VLDB Journal — The International Journal on Very Large Data Bases
Principles of Holism for sequential twig pattern matching

The VLDB Journal — The International Journal on Very Large Data Bases
Processing independent and inter-linked documents in XML databases

IRI'09 Proceedings of the 10th IEEE international conference on Information Reuse & Integration
BPI: XML query evaluation using bitmapped path indices

Proceedings of the 2009 EDBT/ICDT Workshops
Algorithmic Design Using Object-Z for Twig XML Queries Evaluation

Electronic Notes in Theoretical Computer Science (ENTCS)
A path-based node filtering method for efficient structural joins

Information Processing Letters
Efficient physical operators for cost-based XPath execution

Proceedings of the 13th International Conference on Extending Database Technology
Effective pruning for XML structural match queries

Data & Knowledge Engineering
A path-based approach for efficient structural join with not-predicates

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Efficient integration of structure indexes of XML

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Efficient holistic twig joins in leaf-to-root combining with root-to-leaf way

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Exploring XML web collections with DescribeX

ACM Transactions on the Web (TWEB)
XML: some papers in a haystack

ACM SIGMOD Record
Jumping over useless nodes in Xml tree pattern with level based index

ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Evaluation Techniques for Generalized Path Pattern Queries on XML Data

World Wide Web
Towards unifying advances in twig join algorithms

ADC '10 Proceedings of the Twenty-First Australasian Conference on Database Technologies - Volume 104
A syntactic approach to twig-query matching on XML streams

Journal of Systems and Software
A survey on XML keyword search

APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Database and information retrieval techniques for XML

ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
Efficient evaluation of sibling relationship in Xpath queries

ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
Labeling scheme and structural joins for graph-structured XML data

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
FMware: middleware for efficient filtering and matching of XML messages with local data

Middleware'06 Proceedings of the 7th ACM/IFIP/USENIX international conference on Middleware
Exploit sequencing to accelerate XML twig query answering

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Fast reachability query processing

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Fast structural join with a location function

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Using XML structure to reduce candidate nodes participated in query processing

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
Subgraph join: efficient processing subgraph queries on graph-structured XML document

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
S3: Processing tree-pattern XML queries with all logical operators

Data & Knowledge Engineering
An efficient index scheme for XML databases

SOFSEM'06 Proceedings of the 32nd conference on Current Trends in Theory and Practice of Computer Science
Efficient evaluation of XML twig queries

APWeb'06 Proceedings of the 2006 international conference on Advanced Web and Network Technologies, and Applications
Integration of a structural index with a structural join for accelerating path queries

ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part II
Fragment indexing of XML-based metadata for digital TV contents

ICAT'06 Proceedings of the 16th international conference on Advances in Artificial Reality and Tele-Existence
An efficient parallel pathstack algorithm for processing XML twig queries on multi-core systems

DASFAA'10 Proceedings of the 15th international conference on Database Systems for Advanced Applications - Volume Part I
Accelerating XML structural join by partitioning

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
SIOUX: an efficient index for processing structural XQueries

DEXA'05 Proceedings of the 16th international conference on Database and Expert Systems Applications
On-demand index for efficient structural joins

WAIM '06 Proceedings of the 7th international conference on Advances in Web-Age Information Management
Efficiently coding and querying XML document

DNIS'05 Proceedings of the 4th international conference on Databases in Networked Information Systems
HID: an efficient path index for complex XML collections with arbitrary links

DNIS'05 Proceedings of the 4th international conference on Databases in Networked Information Systems
Multiple schema based XML indexing

ICCNMC'05 Proceedings of the Third international conference on Networking and Mobile Computing
Efficiently coding and indexing XML document

DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
A path-based labeling scheme for efficient structural join

XSym'05 Proceedings of the Third international conference on Database and XML Technologies
The BIRD numbering scheme for XML and tree databases – deciding and reconstructing tree relations using efficient arithmetic operations

XSym'05 Proceedings of the Third international conference on Database and XML Technologies
Relational index support for XPath axes

XSym'05 Proceedings of the Third international conference on Database and XML Technologies
Index vs. navigation in XPath evaluation

XSym'06 Proceedings of the 4th international conference on Database and XML Technologies
FLUX: content and structure matching of XPath queries with range predicates

XSym'06 Proceedings of the 4th international conference on Database and XML Technologies
A resource efficient hybrid data structure for twig queries

XSym'06 Proceedings of the 4th international conference on Database and XML Technologies
An efficient branch query rewriting algorithm for XML query optimization

OTM'05 Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, COA, and ODBASE - Volume Part II
Hash-based structural join algorithms

EDBT'06 Proceedings of the 2006 international conference on Current Trends in Database Technology
Advanced structural joins using element distribution

Information Sciences: an International Journal
Examining the impact of data-access cost on XML twig pattern matching

Information Sciences: an International Journal
Efficient evaluation of partial path queries over a XML compact storage structure

Proceedings of the International Conference on Advances in Computing, Communications and Informatics
Efficient probabilistic XML query processing using an extended labeling scheme and a lightweight index

Information Processing and Management: an International Journal
Optimizing XML queries: Bitmapped materialized views vs. indexes

Information Systems
Temporal and multi-versioned XML documents: A survey

Information Processing and Management: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Queries on XML documents typically combine selections on element contents, and, via path expressions, the structural relationships between tagged elements. Structural joins are used to find all pairs of elements satisfying the primitive structural relationships specified in the query, namely, parent-child and ancestor-descendant relationships. Efficient support for structural joins is thus the key to efficient implementations of XML queries. Recently proposed node numbering schemes enable the capturing of the XML document structure using traditional indices (such as B+-trees or R-trees). This paper proposes efficient structural join algorithms in the presence of tag indices. We first concentrate on using B+- trees and show how to expedite a structural join by avoiding collections of elements that do not participate in the join. We then introduce an enhancement (based on sibling pointers) that further improves performance. Such sibling pointers are easily implemented and dynamically maintainable. We also present a structural join algorithm that utilizes R-trees. An extensive experimental comparison shows that the B+-tree structural joins are more robust. Furthermore, they provide drastic improvement gains over the current state of the art.