What makes the differences: benchmarking XML database implementations

Authors:
Hongjun Lu;Jeffrey Xu Yu;Guoren Wang;Shihui Zheng;Haifeng Jiang;Ge Yu;Aoying Zhou
Affiliations:
The Hong Kong University of Science and Technology, Hong Kong, China;The Chinese University of Hong Kong, Hong Kong, China;Northeastern University, Shenyang, China;Fudan University, Shanghai, China;The Hong Kong University of Science and Technology, Hong Kong, China;Northeastern University, Shenyang, China;Fudan University, Shanghai, China
Venue:
ACM Transactions on Internet Technology (TOIT)
Year:
2005

Citing 31
Cited 10

From structured documents to novel query facilities

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Lore: a database management system for semistructured data

ACM SIGMOD Record
Catching the boat with Strudel: experiences with a Web-site management system

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Storing semistructured data with STORED

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
A query language for XML

WWW '99 Proceedings of the eighth international conference on World Wide Web
XML-GL: a graphical language for querying and restructuring XML documents

WWW '99 Proceedings of the eighth international conference on World Wide Web
Comparative analysis of five XML query languages

ACM SIGMOD Record
Comparative analysis of six XML schema languages

ACM SIGMOD Record
On supporting containment queries in relational database management systems

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
XRel: a path-based approach to storage and retrieval of XML documents using relational databases

ACM Transactions on Internet Technology (TOIT)
Path materialization revisited: an efficient storage model for XML data

ADC '02 Proceedings of the 13th Australasian database conference - Volume 5
Accelerating XPath location steps

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
XBase: making your gigabyte disk queriable

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
An XML Indexing Structure with Relative Region Coordinate

Proceedings of the 17th International Conference on Data Engineering
Relational Databases for Querying XML Documents: Limitations and Opportunities

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Indexing and Querying XML Data for Regular Path Expressions

Proceedings of the 27th International Conference on Very Large Data Bases
VXMLR: A Visual XML-Relational Database System

Proceedings of the 27th International Conference on Very Large Data Bases
Performance Evaluation of a DOM-Based XML Database: Storage, Indexing and Query Optimization

WAIM '02 Proceedings of the Third International Conference on Advances in Web-Age Information Management
XBench - A Family of Benchmarks for XML DBMSs

Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
Multi-user Evaluation of XML Data Management Systems with XMach-1

Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
The XOO7 Benchmark

Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
The Michigan Benchmark: A Microbenchmark for XML Query Processing Systems

Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
Efficient Relational Storage and Retrieval of XML Documents

Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
Quilt: An XML Query Language for Heterogeneous Data Sources

Selected papers from the Third International Workshop WebDB 2000 on The World Wide Web and Databases
XMach-1: A Benchmark for XML Data Management

Datenbanksysteme in Büro, Technik und Wissenschaft (BTW), 9. GI-Fachtagung,
Efficient Storage of XML Data

ICDE '00 Proceedings of the 16th International Conference on Data Engineering
The XML benchmark project

The XML benchmark project
XParent: An Efficient RDBMS-Based XML Database System

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Structural Joins: A Primitive for Efficient XML Query Pattern Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Efficient structural joins on indexed XML documents

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
XMark: a benchmark for XML data management

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

A document object modeling method to retrieve data from a very large XML document

Proceedings of the 2007 ACM symposium on Document engineering
Automaton segmentation: a new approach to preserve privacy in xml information brokering

Proceedings of the 14th ACM conference on Computer and communications security
Extending path summary and region encoding for efficient structural query processing in native XML databases

Journal of Systems and Software
Bitmap indexes for relational XML twig query processing

Proceedings of the 18th ACM conference on Information and knowledge management
Optimizing updates of recursive XML views of relations

The VLDB Journal — The International Journal on Very Large Data Bases
Which XML storage for knowledge and ontology systems?

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part I
Benchmarking database representations of RDF/S stores

ISWC'05 Proceedings of the 4th international conference on The Semantic Web
A tale of two approaches: query performance study of XML storage strategies in relational databases

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
OXONE: a scalable solution for detecting superior quality deltas on ordered large xml documents

ER'06 Proceedings of the 25th international conference on Conceptual Modeling
Pragmatic XML access control using off-the-shelf RDBMS

ESORICS'07 Proceedings of the 12th European conference on Research in Computer Security

Quantified Score

Hi-index	0.00

Visualization

Abstract

XML is emerging as a major standard for representing data on the World Wide Web. Recently, many XML storage models have been proposed to manage XML data. In order to assess an XML database's abilities to deal with XML queries, several benchmarks have also been proposed, including XMark and XMach. However, no reported studies using those benchmarks were found that can provide users with insights on the impacts of a variety of storage models on XML query performance. In this article, we report our first set of results on benchmarking a set of XML database implementations using two XML benchmarks. The selected implementations represent a wide range of approaches, including RDBMS-based systems with document-independent and document-dependent XML-relational schema mapping approaches, and XML native engines based on an Object-Oriented Model and the Document Object Model. Comprehensive experiments were conducted to study relative performance of different approaches and the important issues that affect XML query performance, such as path expression query processing, effectiveness of various partitioning, label-path, and indexing structures.