Conceptual modeling for ETL processes
Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Lineage tracing for general data warehouse transformations
The VLDB Journal — The International Journal on Very Large Data Bases
Optimizing ETL Processes in Data Warehouses
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Clio grows up: from research prototype to industrial tool
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Designing ETL processes using semantic web technologies
DOLAP '06 Proceedings of the 9th ACM international workshop on Data warehousing and OLAP
Deciding the physical implementation of ETL workflows
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Partition-based workload scheduling in living data warehouse environments
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Data integration flows for business intelligence
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
RiTE: Providing On-Demand Data for Right-Time Data Warehousing
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
QoX-driven ETL design: reducing the cost of ETL consulting engagements
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Performance Evaluation and Benchmarking
A framework for the design of ETL scenarios
CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Efficiency evaluation of open source ETL tools
Proceedings of the 2011 ACM Symposium on Applied Computing
A semantic approach to ETL technologies
Data & Knowledge Engineering
A model-driven framework for ETL process development
Proceedings of the ACM 14th international workshop on Data Warehousing and OLAP
E-ETL: framework for managing evolving etl processes
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
Type 2 slowly changing dimensions: a case study using the cooperating system
Proceedings of the fifteenth international workshop on Data warehousing and OLAP
BPMN-based conceptual modeling of ETL processes
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
A BPMN-Based Design and Maintenance Framework for ETL Processes
International Journal of Data Warehousing and Mining
Extending ER models to capture database transformations to build data sets for data mining
Data & Knowledge Engineering
Hi-index | 0.00 |
Extract-Transform-Load (ETL) activities are software modules responsible for populating a data warehouse with operational data, which have undergone a series of transformations on their way to the warehouse. The whole process is very complex and of signifi-cant importance for the design and maintenance of the data ware-house. A plethora of commercial ETL tools are already available in the market. However, each one of them follows a different ap-proach for the modeling of ETL activities; i.e., of the building blocks of an ETL workflow. As a result, so far there is no standard or unified approach for describing such activities. In this paper, we are working towards the identification of generic properties that characterize ETL activities. In doing so, we follow a black-box approach and provide a taxonomy that characterizes ETL activities in terms of the relationship of their input to their output and provide a normal form that is based on interpreted semantics for the black box activities. Finally, we show how the proposed taxonomy can be used in the construction of larger modules, i.e., ETL archetype patterns, which can be used for the composition and optimization of ETL workflows.