Modeling data warehouse schema evolution over extended hierarchy semantics

  • Authors:
  • Sandipto Banerjee;Karen C. Davis

  • Affiliations:
  • MicroStrategy, Inc., 1861 International Drive, McLean, VA;Electrical & Computer Engineering Dept., University of Cincinnati, Cincinnati, OH

  • Venue:
  • Journal on Data Semantics XIII
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Models for conceptual design of data warehouse schemas have been proposed, but few researchers have addressed schema evolution in a formal way and none have presented software tools for enforcing the correctness of multidimensional schema evolution operators. We generalize the core features typically found in data warehouse data models, along with modeling extended hierarchy semantics. The advanced features include multiple hierarchies, non-covering hierarchies, non-onto hierarchies, and non-strict hierarchies. We model the constructs in the Uni-level Description Language (ULD) as well as using a multilevel dictionary definition (MDD) approach. The ULD representation provides a formal foundation to specify transformation rules for the semantics of schema evolution operators. The MDD gives a basis for direct implementation in a relational database system; we define model constraints and then use the constraints to maintain integrity when schema evolution operators are applied. This paper contributes a formalism for representing data warehouse schemas and determining the validity of schema evolution operators applied to a schema. We describe a software tool that allows for visualization of the impact of schema evolution through the use of triggers and stored procedures.