Compile-time minimisation of load imbalance in loop nests
ICS '97 Proceedings of the 11th international conference on Supercomputing
On supporting containment queries in relational database management systems
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
XOO7: applying OO7 benchmark to XML query processing tool
Proceedings of the tenth international conference on Information and knowledge management
Holistic twig joins: optimal XML pattern matching
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Parallel Processing XML Documents
IDEAS '02 Proceedings of the 2002 International Symposium on Database Engineering & Applications
Structural Joins: A Primitive for Efficient XML Query Pattern Matching
ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Introduction to Data Mining, (First Edition)
Introduction to Data Mining, (First Edition)
WIN: An E.cient Data Placement Strategy for Parallel XML Databases
ICPADS '05 Proceedings of the 11th International Conference on Parallel and Distributed Systems - Volume 01
Processing XPath Queries in PC-Clusters Using XML Data Partitioning
ICDEW '06 Proceedings of the 22nd International Conference on Data Engineering Workshops
XMark: a benchmark for XML data management
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Querying XML Data using PC Cluster System
DEXA '07 Proceedings of the 18th International Conference on Database and Expert Systems Applications
XML data partitioning strategies to improve parallelism in parallel holistic twig joins
Proceedings of the 3rd International Conference on Ubiquitous Information Management and Communication
Executing parallel TwigStack algorithm on a multi-core system
Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services
Distributed SLCA-based XML keyword search by map-reduce
DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
Hi-index | 0.00 |
As traditional partitioning strategies do not serve well for semistructured data, partitioning and distributing heterogeneous XML documents onto a parallel cluster system have lead to such an intricacy issue for maintaining good query processing performance. In this paper, we propose a grid metadata model for XML that gives a conceptual view to partition XML data, specifically for holistic twig joins processing. The proposed model adopts a cost-based model and facilitates a set of partition refinement methods for workload balancing purpose. The model has features of reducing the workload variance significantly on the cluster system, duplicating XML data necessarily to avoid data dependency among cluster nodes, and exploiting inter query parallelism and intra query parallelism. We evaluate the effectiveness of our proposed model in the experiment that our data partitioning method has better workload balance and has an impact on better parallel speed up performance as well.