ACM Transactions on Database Systems (TODS)
Efficient and extensible algorithms for multi query optimization
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Pipelining in multi-query optimization
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Building the Data Warehouse
The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing and Deploying Data Warehouses with CD Rom
Optimizing ETL Processes in Data Warehouses
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Deciding the physical implementation of ETL workflows
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Data integration flows for business intelligence
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
RiTE: Providing On-Demand Data for Right-Time Data Warehousing
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Scheduling Updates in a Real-Time Stream Warehouse
ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Blueprints and measures for ETL workflows
ER'05 Proceedings of the 24th international conference on Conceptual Modeling
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Optimal top-k query evaluation for weighted business processes
Proceedings of the VLDB Endowment
Leveraging business process models for ETL design
ER'10 Proceedings of the 29th international conference on Conceptual modeling
Designing integration flows using hypercubes
Proceedings of the 14th International Conference on Extending Database Technology
Live business intelligence for the real-time enterprise
From active data management to event-based systems and more
MaSM: efficient online updates in data warehouses
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Better drilling through sensor analytics: a case study in live operational intelligence
Proceedings of the Fifth International Workshop on Knowledge Discovery from Sensor Data
GEM: requirement-driven generation of ETL and multidimensional conceptual designs
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
E-ETL: framework for managing evolving etl processes
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
Live BI: a framework for real time operations management
DNIS'11 Proceedings of the 7th international conference on Databases in Networked Information Systems
Optimizing analytic data flows for multiple execution engines
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Optimization of analytic data flows for next generation business intelligence applications
TPCTC'11 Proceedings of the Third TPC Technology conference on Topics in Performance Evaluation, Measurement and Characterization
Integrating ETL processes from information requirements
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
xPAD: a platform for analytic data flows
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Scheduling strategies for efficient ETL execution
Information Systems
On the aggregation problem for synthesized Web services
Journal of Computer and System Sciences
A QoX model for ETL subsystems: theoretical and industry perspectives
Proceedings of the 14th International Conference on Computer Systems and Technologies
Hybrid Analytic Flows-the Case for Optimization
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Hi-index | 0.00 |
As business intelligence becomes increasingly essential for organizations and as it evolves from strategic to operational, the complexity of Extract-Transform-Load (ETL) processes grows. In consequence, ETL engagements have become very time consuming, labor intensive, and costly. At the same time, additional requirements besides functionality and performance need to be considered in the design of ETL processes. In particular, the design quality needs to be determined by an intricate combination of different metrics like reliability, maintenance, scalability, and others. Unfortunately, there are no methodologies, modeling languages or tools to support ETL design in a systematic, formal way for achieving these quality requirements. The current practice handles them with ad-hoc approaches only based on designers' experience. This results in either poor designs that do not meet the quality objectives or costly engagements that require several iterations to meet them. A fundamental shift that uses automation in the ETL design task is the only way to reduce the cost of these engagements while obtaining optimal designs. Towards this goal, we present a novel approach to ETL design that incorporates a suite of quality metrics, termed QoX, at all stages of the design process. We discuss the challenges and tradeoffs among QoX metrics and illustrate their impact on alternative designs.