Multiresolution Indexing of XML for Frequent Queries

Authors:
Hao He;Jun Yang
Affiliations:
-;-
Venue:
ICDE '04 Proceedings of the 20th International Conference on Data Engineering
Year:
2004

Citing 13
Cited 30

Three partition refinement algorithms

SIAM Journal on Computing
Lore: a database management system for semistructured data

ACM SIGMOD Record
On supporting containment queries in relational database management systems

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
APEX: an adaptive path index for XML data

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Covering indexes for branching path queries

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Querying Semi-Structured Data

ICDT '97 Proceedings of the 6th International Conference on Database Theory
Index Structures for Path Expressions

ICDT '99 Proceedings of the 7th International Conference on Database Theory
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Indexing and Querying XML Data for Regular Path Expressions

Proceedings of the 27th International Conference on Very Large Data Bases
A Fast Index for Semistructured Data

Proceedings of the 27th International Conference on Very Large Data Bases
The XML benchmark project

The XML benchmark project
D(k)-index: an adaptive structural summary for graph-structured data

Proceedings of the 2003 ACM SIGMOD international conference on Management of data
Exploiting Local Similarity for Indexing Paths in Graph-Structured Data

ICDE '02 Proceedings of the 18th International Conference on Data Engineering

Incremental maintenance of XML structural indexes

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Efficient Creation and Incremental Maintenance of the HOPI Index for Complex XML Document Collections

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
On boosting holism in XML twig pattern matching using structural indexing techniques

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Efficient processing of XML path queries using the disk-based F&B Index

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Indexing dataspaces

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Answering XML queries by means of data summaries

ACM Transactions on Information Systems (TOIS)
Efficiently Querying Large XML Data Repositories: A Survey

IEEE Transactions on Knowledge and Data Engineering
Enabling structural summaries for efficient update and workload adaptation

Data & Knowledge Engineering
XML twig pattern matching using version tree

Data & Knowledge Engineering
Structural summaries for efficient XML query processing

Ph.D. '08 Proceedings of the 2008 EDBT Ph.D. workshop
Temporal XML: modeling, indexing, and query processing

The VLDB Journal — The International Journal on Very Large Data Bases
Warehousing complex data from the web

International Journal of Web Engineering and Technology
RRSi: indexing XML data for proximity twig queries

Knowledge and Information Systems
Query Optimization for Complex Path Queries on XML Data

DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
A methodology for coupling fragments of XPath with structural indexes for XML documents

Information Systems
A methodology for coupling fragments of XPath with structural indexes for XML documents

DBPL'07 Proceedings of the 11th international conference on Database programming languages
An efficient index lattice for XML query evaluation

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
AB-index: an efficient adaptive index for branching XML queries

DASFAA'07 Proceedings of the 12th international conference on Database systems for advanced applications
Exploring XML web collections with DescribeX

ACM Transactions on the Web (TWEB)
Data sources selection for XML data sources

International Journal of Intelligent Information and Database Systems
Key concepts for native XML processing

From active data management to event-based systems and more
Practical indexing XML Document For Twig query

ASIAN'05 Proceedings of the 10th Asian Computing Science conference on Advances in computer science: data management on the web
Clustered chain path index for XML document: efficiently processing branch queries

WISE'06 Proceedings of the 7th international conference on Web Information Systems
Adaptively indexing dynamic XML

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Exploit sequencing to accelerate XML twig query answering

DASFAA'06 Proceedings of the 11th international conference on Database Systems for Advanced Applications
Clustered absolute path index for XML document: on efficient processing of twig queries

APWeb'06 Proceedings of the 2006 international conference on Advanced Web and Network Technologies, and Applications
A new design for a native XML storage and indexing manager

EDBT'06 Proceedings of the 10th international conference on Advances in Database Technology
Combining tree structure indexes with structural indexes in query evaluation on XML data

ADBIS'05 Proceedings of the 9th East European conference on Advances in Databases and Information Systems
Efficient processing SAPE queries using the dynamic labelling structural indexes

ADBIS'06 Proceedings of the 10th East European conference on Advances in Databases and Information Systems
OXDP & OXiP: the notion of objects for efficient large XML data queries

International Journal of Grid and Utility Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

XML and other types of semi-structured data are typicallyrepresented by a labeled directed graph. To speedup path expression queries over the graph, a variety ofstructural indexes have been proposed. They usually workby partitioning nodes in the data graph into equivalenceclasses and storing equivalence classes as index nodes.A(k)-index introduces the concept of local bisimilarity forpartitioning, allowing the trade-off between index size andquery answering power. However, all index nodes in A(k)-indexhave the same local similarity k, which cannot takeadvantage of the fact that a workload may contain path expressionsof different lengths, or that different parts of thedata graph may have different local similarity requirements.To overcome these limitations, we propose M(k)- andM*(k)-indexes. The basic M(k)-index is workload-aware:Like the previously proposed D(k)-index, it allows differentindex nodes to have different local similarity requirements,providing finer partitioning only for parts of the datagraph targeted by longer path expressions. Unlike D(k)-index,M(k)-index is never over-refined for irrelevant indexor data nodes. However, the workload-aware featurestill incurs overrefinement due to over-qualified parent indexnodes. Moreover, fine partitions penalize the performanceof short path expressions. To solve these problems,we further propose the M*(k)-index. An M*(k)-index consistsof a collection of indexes whose nodes are organizedin a partition hierarchy, allowing successively coarser partitioninginformation to co-exist with the finest partitioninginformation required. Experiments show that our indexesare superior to previously proposed indexes in terms of indexsize and query performance.