The Michigan benchmark: towards XML query performance diagnostics

Authors:
Kanda Runapongsa;Jignesh M. Patel;H. V. Jagadish;Yun Chen;Shurug Al-Khalifa
Affiliations:
Department of Electrical Engineering and Computer Science, University of Michigan, 1301 Beal Avenue, Ann Arbor, MI 48109-2122, USA;Department of Electrical Engineering and Computer Science, University of Michigan, 1301 Beal Avenue, Ann Arbor, MI 48109-2122, USA;Department of Electrical Engineering and Computer Science, University of Michigan, 1301 Beal Avenue, Ann Arbor, MI 48109-2122, USA;Department of Electrical Engineering and Computer Science, University of Michigan, 1301 Beal Avenue, Ann Arbor, MI 48109-2122, USA;Department of Electrical Engineering and Computer Science, University of Michigan, 1301 Beal Avenue, Ann Arbor, MI 48109-2122, USA
Venue:
Information Systems
Year:
2006

Citing 18
Cited 5

Random number generators: good ones are hard to find

Communications of the ACM
The 007 Benchmark

SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Shoring up persistent applications

SIGMOD '94 Proceedings of the 1994 ACM SIGMOD international conference on Management of data
Lore: a database management system for semistructured data

ACM SIGMOD Record
Efficient evaluation of XML middle-ware queries

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Microsoft SQL Server 2000 Resource Kit

Microsoft SQL Server 2000 Resource Kit
Why and how to benchmark XML databases

ACM SIGMOD Record
Current Approaches to XML Management

IEEE Internet Computing
Tamino - A DBMS designed for XML

Proceedings of the 17th International Conference on Data Engineering
Relational Databases for Querying XML Documents: Limitations and Opportunities

VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Querying XML Views of Relational Data

Proceedings of the 27th International Conference on Very Large Data Bases
Multi-user Evaluation of XML Data Management Systems with XMach-1

Proceedings of the VLDB 2002 Workshop EEXTT and CAiSE 2002 Workshop DTWeb on Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web-Revised Papers
XMach-1: A Benchmark for XML Data Management

Datenbanksysteme in Büro, Technik und Wissenschaft (BTW), 9. GI-Fachtagung,
Efficient XML Data Management: An Analysis

EC-WEB '02 Proceedings of the Third International Conference on E-Commerce and Web Technologies
Efficiently publishing relational data as XML documents

The VLDB Journal — The International Journal on Very Large Data Bases
Structural Joins: A Primitive for Efficient XML Query Pattern Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
XBench Benchmark and Performance Testing of XML DBMSs

ICDE '04 Proceedings of the 20th International Conference on Data Engineering
XMark: a benchmark for XML data management

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases

Generation of synthetic XML for evaluation of hybrid XML systems

DASFAA'10 Proceedings of the 15th international conference on Database systems for advanced applications
Efficiently querying XML documents stored in RDBMS in the presence of Dewey-based labeling scheme

ACIIDS'10 Proceedings of the Second international conference on Intelligent information and database systems: Part I
XWeB: the XML warehouse benchmark

TPCTC'10 Proceedings of the Second TPC technology conference on Performance evaluation, measurement and characterization of complex systems
s-XML: An efficient mapping scheme to bridge XML and relational database

Knowledge-Based Systems
Count-Constraints for generating XML

NGITS'06 Proceedings of the 6th international conference on Next Generation Information Technologies and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a micro-benchmark for XML data management to aid engineers in designing improved XML processing engines. This benchmark is inherently different from application-level benchmarks, which are designed to help users choose between alternative products. We primarily attempt to capture the rich variety of data structures and distributions possible in XML, and to isolate their effects, without imitating any particular application. The benchmark specifies a single data set against which carefully specified queries can be used to evaluate system performance for XML data with various characteristics. We have used the benchmark to analyze the performance of three database systems: two native XML DBMSs, and a commercial ORDBMS. The benchmark reveals key strengths and weaknesses of these systems. We find that commercial relational techniques are effective for XML query processing in many cases, but are sensitive to query rewriting, and require better support for efficiently determining indirect structural containment. In addition, the benchmark also played an important role in helping the development team of Timber (our native XML DBMS) devise a more effective access method, and fine tune the implementation of the structural join algorithms.