Improving OLTP data quality using data warehouse mechanisms
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Evolvable view environment (EVE): non-equivalent view maintenance under schema changes
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Maintaining data warehouses over changing information sources
Communications of the ACM
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Conceptual modeling for ETL processes
Proceedings of the 5th ACM international workshop on Data Warehousing and OLAP
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
Architecture and Quality in Data Warehouses
CAiSE '98 Proceedings of the 10th International Conference on Advanced Information Systems Engineering
The COMET Metamodel for Temporal Data Warehouses
CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
An Extensible Framework for Data Cleaning
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Optimizing ETL Processes in Data Warehouses
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
State-Space Optimization of ETL Workflows
IEEE Transactions on Knowledge and Data Engineering
Deciding the physical implementation of ETL workflows
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Partition-based workload scheduling in living data warehouse environments
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Managing and querying transaction-time databases under schema evolution
Proceedings of the VLDB Endowment
Natural language reporting for ETL processes
Proceedings of the ACM 11th international workshop on Data warehousing and OLAP
QoX-driven ETL design: reducing the cost of ETL consulting engagements
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Automatic generation of ETL processes from conceptual models
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Defining ETL worfklows using BPMN and BPEL
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
pygrametl: a powerful programming framework for extract-transform-load programmers
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Cardinality estimation in ETL processes
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Generating data quality rules and integration into ETL process
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
A generic and customizable framework for the design of ETL scenarios
Information Systems - Special issue: The 15th international conference on advanced information systems engineering (CAiSE 2003)
ETL workflows: from formal specification to optimization
ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Graph-based modeling of ETL activities with multi-level transformations and updates
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Policy-Regulated management of ETL evolution
Journal on Data Semantics XIII
Rule-Based management of schema changes at ETL sources
ADBIS'09 Proceedings of the 13th East European conference on Advances in Databases and Information Systems
What-if analysis for data warehouse evolution
DaWaK'07 Proceedings of the 9th international conference on Data Warehousing and Knowledge Discovery
PIKM 2011: the 4th ACM workshop for Ph.D. students in information and knowledge management
Proceedings of the 20th ACM international conference on Information and knowledge management
Hi-index | 0.00 |
External data sources (EDSs) being integrated in a data warehouse (DW) frequently change their data structures (schemas). As a consequence, in many cases, an already deployed ETL workflow executes with errors. Since structural changes of EDSs are frequent, an automatic reparation of an ETL workflow after such changes is of a high importance. In this paper we present a framework for handling the evolution of an ETL layer. To this end, structural changes are monitored and stored in a Metabase. An erroneous execution of an ETL workflow causes a reparation of the ETL activities that interact with the changed EDS, so that the repaired activities can work on the changed EDS schema. The reparation of the ETL activities is guided by several customizable reparation algorithms. The proposed framework was developed as a module external to an ETL engine, accessing the engine by means of API. The innovation of this framework are algorithms for semi-automatic reparation of an ETL workflow.