Conceptual modeling for ETL processes

  • Authors:
  • Panos Vassiliadis;Alkis Simitsis;Spiros Skiadopoulos

  • Affiliations:
  • National Technical University of Athens, Athens, Greece;National Technical University of Athens, Athens, Greece;National Technical University of Athens, Athens, Greece

  • Venue:
  • Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. In this paper, we focus on the problem of the definition of ETL activities and provide formal foundations for their conceptual representation. The proposed conceptual model is (a) customized for the tracing of inter-attribute relationships and the respective ETL activities in the early stages of a data warehouse project; (b) enriched with a 'palette' of a set of frequently used ETL activities, like the assignment of surrogate keys, the check for null values, etc; and (c) constructed in a customizable and extensible manner, so that the designer can enrich it with his own re-occurring patterns for ETL activities.