Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
XWeB: the XML warehouse benchmark
TPCTC'10 Proceedings of the Second TPC technology conference on Performance evaluation, measurement and characterization of complex systems
Efficiency evaluation of open source ETL tools
Proceedings of the 2011 ACM Symposium on Applied Computing
Optimization of analytic data flows for next generation business intelligence applications
TPCTC'11 Proceedings of the Third TPC Technology conference on Topics in Performance Evaluation, Measurement and Characterization
Towards benchmarking stream data warehouses
Proceedings of the fifteenth international workshop on Data warehousing and OLAP
What is the IQ of your data transformation system?
Proceedings of the 21st ACM international conference on Information and knowledge management
Scheduling strategies for efficient ETL execution
Information Systems
Near real-time with traditional data warehouse architectures: factors and how-to
Proceedings of the 17th International Database Engineering & Applications Symposium
Hi-index | 0.00 |
Extraction---Transform---Load (ETL) processes comprise complex data workflows, which are responsible for the maintenance of a Data Warehouse. A plethora of ETL tools is currently available constituting a multi-million dollar market. Each ETL tool uses its own technique for the design and implementation of an ETL workflow, making the task of assessing ETL tools extremely difficult. In this paper, we identify common characteristics of ETL workflows in an effort of proposing a unified evaluation method for ETL. We also identify the main points of interest in designing, implementing, and maintaining ETL workflows. Finally, we propose a principled organization of test suites based on the TPC-H schema for the problem of experimenting with ETL workflows.