Schema versioning in data warehouses: enabling cross-version querying via schema augmentation

  • Authors:
  • Matteo Golfarelli;Jens Lechtenbörger;Stefano Rizzi;Gottfried Vossen

  • Affiliations:
  • DEIS, University of Bologna, Italy;Department of Information Systems, University of Muenster, Muenster, Germany;DEIS, University of Bologna, Italy;Department of Information Systems, University of Muenster, Muenster, Germany

  • Venue:
  • Data & Knowledge Engineering - Special issue: WIDM 2004
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

As several mature implementations of data warehousing systems are fully operational, a crucial role in preserving their up-to-dateness is played by the ability to manage the changes that the data warehouse (DW) schema undergoes over time in response to evolving business requirements. In this paper we propose an approach to schema versioning in DWs, where the designer may decide to undertake some actions on old data aimed at increasing the flexibility in formulating cross-version queries, i.e., queries spanning multiple schema versions. First, we introduce a representation of DW schemata as graphs of simple functional dependencies, and discuss its properties. Then, after defining an algebra of schema graph modification operations aimed at creating new schema versions, we discuss how augmented schemata can be introduced to increase flexibility in cross-version querying. Next, we show how a history of versions for DW schemata is managed and discuss the relationship between the temporal horizon spanned by a query and the schema on which it can consistently be formulated.