COVER model pivot view indexing for efficient XML data management

  • Authors:
  • Fuesane Cheng;Yelena Yesha

  • Affiliations:
  • University of Maryland, Baltimore, MD;University of Maryland, Baltimore, MD

  • Venue:
  • Proceedings of the 2010 Conference of the Center for Advanced Studies on Collaborative Research
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

In order to support corporate business' competition on speed to market for product and service development, generically modeled data structures have been widely used in the development of vertical application software systems, and in storing XML and RDF data for its flexibility, adaptability, and agility. However, generic data models require multiple self-joins on a single table with a large volume of data, causing slow performance for business intelligence (BI) applications. On the other hand, shredded XML data stored in traditional specific data models have faster performance but are not flexible, adaptive, or agile for speed to market. A generic data model named the Class Object Value Element Relationship (COVER) model was developed for storing node-oriented tree data information, and is suitable for automated pivot view generation and distributed data processing. This approach utilizes pivot views with appropriate metadata constructs to expose the search predicate fields for indexing and results in performance gains in data retrieval from branches or leaves across multiple trees for production support or data retrieval to feed business intelligence and data mining. Benchmark experiments for comparing the query performance on the COVER model against self-join and XPath/XQuery approaches using RDBMS were executed and proved that the COVER model outperforms the other two on the same sets of test data queries.