Principles of distributed database systems
Principles of distributed database systems
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
Efficient resumption of interrupted warehouse loads
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Query Optimization in Database Systems
ACM Computing Surveys (CSUR)
Continuous queries over data streams
ACM SIGMOD Record
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
A Transactional Model for Data Warehouse Maintenance
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
Lineage tracing for general data warehouse transformations
The VLDB Journal — The International Journal on Very Large Data Bases
Aurora: a new model and architecture for data stream management
The VLDB Journal — The International Journal on Very Large Data Bases
A framework for the design of ETL scenarios
CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
State-Space Optimization of ETL Workflows
IEEE Transactions on Knowledge and Data Engineering
Mapping conceptual to logical models for ETL processes
Proceedings of the 8th ACM international workshop on Data warehousing and OLAP
Research in data warehouse modeling and design: dead or alive?
DOLAP '06 Proceedings of the 9th ACM international workshop on Data warehousing and OLAP
One-to-many data transformations through data mappers
Data & Knowledge Engineering
Deciding the physical implementation of ETL workflows
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Journal of Systems and Software
A method for the mapping of conceptual designs to logical blueprints for ETL processes
Decision Support Systems
Real-time data warehouse loading methodology
IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Towards generating ETL processes for incremental loading
IDEAS '08 Proceedings of the 2008 international symposium on Database engineering & applications
Workload-based optimization of integration processes
Proceedings of the 17th ACM conference on Information and knowledge management
Data integration flows for business intelligence
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
GCIP: exploiting the generation and optimization of integration processes
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
QoX-driven ETL design: reducing the cost of ETL consulting engagements
Proceedings of the 2009 ACM SIGMOD International Conference on Management of data
Cost-Based Vectorization of Instance-Based Integration Processes
ADBIS '09 Proceedings of the 13th East European Conference on Advances in Databases and Information Systems
Optimizing data warehouse loading procedures for enabling useful-time data warehousing
IDEAS '09 Proceedings of the 2009 International Database Engineering & Applications Symposium
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Defining ETL worfklows using BPMN and BPEL
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
Cardinality estimation in ETL processes
Proceedings of the ACM twelfth international workshop on Data warehousing and OLAP
State driven semantic modeling of operators in ETL workflow
Journal of Computing Sciences in Colleges
Callisto: mergers without pain
BIRTE'06 Proceedings of the 1st international conference on Business intelligence for the real-time enterprises
ETL workflows: from formal specification to optimization
ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Scalable performance of system S for extract-transform-load processing
Proceedings of the 3rd Annual Haifa Experimental Systems Conference
Optimized incremental ETL jobs for maintaining data warehouses
Proceedings of the Fourteenth International Database Engineering & Applications Symposium
Cost-based vectorization of instance-based integration processes
Information Systems
MapMerge: correlating independent schema mappings
Proceedings of the VLDB Endowment
Leveraging business process models for ETL design
ER'10 Proceedings of the 29th international conference on Conceptual modeling
Real-time data warehousing for business intelligence
Proceedings of the 8th International Conference on Frontiers of Information Technology
Efficiency evaluation of open source ETL tools
Proceedings of the 2011 ACM Symposium on Applied Computing
Tracing data errors with view-conditioned causality
Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
A semantic approach to ETL technologies
Data & Knowledge Engineering
TTL: a transformation, transference and loading approach for active monitoring
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
DWEVOLVE: a requirement based framework for data warehouse evolution
ACM SIGSOFT Software Engineering Notes
E-ETL: framework for managing evolving etl processes
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
Workflow based security incident management
PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Data mapper: an operator for expressing one-to-many data transformations
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Data cleaning and transformation using the AJAX framework
GTTSE'05 Proceedings of the 2005 international conference on Generative and Transformational Techniques in Software Engineering
MapMerge: correlating independent schema mappings
The VLDB Journal — The International Journal on Very Large Data Bases
Optimizing analytic data flows for multiple execution engines
SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Stubby: a transformation-based optimizer for MapReduce workflows
Proceedings of the VLDB Endowment
Optimization of analytic data flows for next generation business intelligence applications
TPCTC'11 Proceedings of the Third TPC Technology conference on Topics in Performance Evaluation, Measurement and Characterization
ACM SIGSOFT Software Engineering Notes
Integrating ETL processes from information requirements
DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
Object Migration Tool for Data Warehouses
International Journal of Strategic Information Technology and Applications
Hybrid Analytic Flows-the Case for Optimization
Fundamenta Informaticae - Scalable Workflow Enactment Engines and Technology
Hi-index | 0.00 |
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. Usually, these processes must be completed in a certain time window; thus, it is necessary to optimize their execution time. In this paper, we delve into the logical optimization of ETL processes, modeling it as a state-space search problem. We consider each ETL workflow as a state and fabricate the state space through a set of correct state transitions. Moreover, we provide algorithms towards the minimization of the execution cost of an ETL workflow.