XMark: a benchmark for XML data management

  • Authors:
  • Albrecht Schmidt;Florian Waas;Martin Kersten;Michael J. Carey;Ioana Manolescu;Ralph Busse

  • Affiliations:
  • CWI, Kruislaan, GB, Amsterdam, The Netherlands;Microsoft Corporation, Redmond;CWI, Kruislaan, GB Amsterdam, The Netherlands;BEA Systems, Inc.,;INRIA-Rocquencourt, Le Chesnay Cedex, France;FHG-IPSI, Darmstadt, Germany

  • Venue:
  • VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

While standardization efforts for XML query languages have been progressing, researchers and users increasingly focus on the database technology that has to deliver on the new challenges that the abundance of XML documents poses to data management: validation, performance evaluation and optimization of XML query processors are the upcoming issues. Following a long tradition in database research, we provide a framework to assess the abilities of an XML database to cope with a broad range of different query types typically encountered in real-world scenarios. The benchmark can help both implementors and users to compare XML databases in a standardized application scenario. To this end, we offer a set of queries where each query is intended to challenge a particular aspect of the query processor. The overall workload we propose consists of a scalable document database and a concise, yet comprehensive set of queries which covers the major aspects of XML query processing ranging from textual features to data analysis queries and ad hoc queries. We complement our research with results we obtained from running the benchmark on several XML database platforms. These results are intended to give a first baseline and illustrate the state of the art.