Principles of distributed database systems
Principles of distributed database systems
Query evaluation techniques for large databases
ACM Computing Surveys (CSUR)
Schema equivalence in heterogeneous systems: bridging theory and practice
Information Systems - Special issue on extending database technology
Efficient resumption of interrupted warehouse loads
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Query Optimization in Database Systems
ACM Computing Surveys (CSUR)
Fundamentals of Database Systems
Fundamentals of Database Systems
Continuous queries over data streams
ACM SIGMOD Record
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
A Transactional Model for Data Warehouse Maintenance
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
A Model Theory for Generic Schema Management
DBPL '01 Revised Papers from the 8th International Workshop on Database Programming Languages
Lineage tracing for general data warehouse transformations
The VLDB Journal — The International Journal on Very Large Data Bases
Aurora: a new model and architecture for data stream management
The VLDB Journal — The International Journal on Very Large Data Bases
Optimizing ETL Processes in Data Warehouses
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Mapping adaptation under evolving schemas
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A framework for the design of ETL scenarios
CAiSE'03 Proceedings of the 15th international conference on Advanced information systems engineering
Designing ETL processes using semantic web technologies
DOLAP '06 Proceedings of the 9th ACM international workshop on Data warehousing and OLAP
Deciding the physical implementation of ETL workflows
Proceedings of the ACM tenth international workshop on Data warehousing and OLAP
Data description and data access mechanism in distributed workflow system
Proceedings of the 2nd international conference on Scalable information systems
Automating the loading of business process data warehouses
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
ETL Workflow Analysis and Verification Using Backwards Constraint Propagation
CAiSE '09 Proceedings of the 21st International Conference on Advanced Information Systems Engineering
Measures for ETL processes models in data warehouses
Proceedings of the first international workshop on Model driven service engineering and data quality and security
Representation of conceptual ETL designs in natural language using Semantic Web technology
Data & Knowledge Engineering
ETL workflows: from formal specification to optimization
ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Information and Software Technology
A framework for OLAP content personalization
ADBIS'10 Proceedings of the 14th east European conference on Advances in databases and information systems
Proceedings of the International Conference & Workshop on Emerging Trends in Technology
A quest for beauty and wealth (or, business processes for database researchers)
Proceedings of the thirtieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
A new approach to performance optimization of mashups via data flow refactoring
Proceedings of the Second Asia-Pacific Symposium on Internetware
E-ETL: framework for managing evolving etl processes
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
Workflow based security incident management
PCI'05 Proceedings of the 10th Panhellenic conference on Advances in Informatics
Workflow clustering method based on process similarity
ICCSA'06 Proceedings of the 2006 international conference on Computational Science and Its Applications - Volume Part II
Ontology-Driven conceptual design of ETL processes using graph transformations
Journal on Data Semantics XIII
Scheduling strategies for efficient ETL execution
Information Systems
Hi-index | 0.01 |
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization, and insertion into a data warehouse. In this paper, we delve into the logical optimization of ETL processes, modeling it as a state-space search problem. We consider each ETL workflow as a state and fabricate the state space through a set of correct state transitions. Moreover, we provide an exhaustive and two heuristic algorithms toward the minimization of the execution cost of an ETL workflow. The heuristic algorithm with greedy characteristics significantly outperforms the other two algorithms for a large set of experimental cases.