Graph-based modeling of ETL activities with multi-level transformations and updates

  • Authors:
  • Alkis Simitsis;Panos Vassiliadis;Manolis Terrovitis;Spiros Skiadopoulos

  • Affiliations:
  • Dept. of Electrical and Computer Eng., National Technical University of Athens, Athens, Hellas;Dept. of Computer Science, University of Ioannina, Ioannina, Hellas;Dept. of Electrical and Computer Eng., National Technical University of Athens, Athens, Hellas;Dept. of Electrical and Computer Eng., National Technical University of Athens, Athens, Hellas

  • Venue:
  • DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Extract-Transform-Load (ETL) workflows are data centric workflows responsible for transferring, cleaning, and loading data from their respective sources to the warehouse. In this paper, we build upon existing graph-based modeling techniques that treat ETL workflows as graphs by (a) extending the activity semantics to incorporate negation, aggregation and self-joins, (b) complementing querying semantics with insertions, deletions and updates, and (c) transforming the graph to allow zoom-in/out at multiple levels of abstraction (i.e., passing from the detailed description of the graph at the attribute level to more compact variants involving programs, relations and queries and vice-versa).