Integrating ETL processes from information requirements

  • Authors:
  • Petar Jovanovic;Oscar Romero;Alkis Simitsis;Alberto Abelló

  • Affiliations:
  • BarcelonaTech, Universitat Politècnica de Catalunya, Barcelona, Spain;BarcelonaTech, Universitat Politècnica de Catalunya, Barcelona, Spain;HP Labs, Palo Alto, CA;BarcelonaTech, Universitat Politècnica de Catalunya, Barcelona, Spain

  • Venue:
  • DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data warehouse (DW) design is based on a set of requirements expressed as service level agreements (SLAs) and business level objects (BLOs). Populating a DW system from a set of information sources is realized with extract-transform-load (ETL) processes based on SLAs and BLOs. The entire task is complex, time consuming, and hard to be performed manually. This paper presents our approach to the requirement-driven creation of ETL designs. Each requirement is considered separately and a respective ETL design is produced. We propose an incremental method for consolidating these individual designs and creating an ETL design that satisfies all given requirements. Finally, the design produced is sent to an ETL engine for execution. We illustrate our approach through an example based on TPC-H and report on our experimental findings that show the effectiveness and quality of our approach.