An overview of data warehousing and OLAP technology
ACM SIGMOD Record
Arktos: towards the modeling, design, control and execution of ETL processes
Information Systems - Data extraction, cleaning and reconciliation
The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing and Deploying Data Warehouses with CD Rom
Data Warehousing for E-Business
Data Warehousing for E-Business
Capturing Delays and Valid Times in Data Warehouses—Towards Timely Consistent Analyses
Journal of Intelligent Information Systems - Special issue on data warehousing and knowledge discovery
Continuous queries over data streams
ACM SIGMOD Record
SAP Business Information Warehouse - From Data Warehousing to an E-business Platform
Proceedings of the 17th International Conference on Data Engineering
Incremental Computation and Maintenance of Temporal Aggregates
Proceedings of the 17th International Conference on Data Engineering
Performance Issues in Incremental Warehouse Maintenance
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Striving towards Near Real-Time Data Integration for Data Warehouses
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
Towards an Accommodation of Delay in Temporal Active Databases
ADC '00 Proceedings of the Australasian Database Conference
Temporal data warehousing
Aurora: a new model and architecture for data stream management
The VLDB Journal — The International Journal on Very Large Data Bases
Optimizing ETL Processes in Data Warehouses
ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Flexible time management in data stream systems
PODS '04 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
ETL queues for active data warehousing
Proceedings of the 2nd international workshop on Information quality in information systems
A generic and customizable framework for the design of ETL scenarios
Information Systems - Special issue: The 15th international conference on advanced information systems engineering (CAiSE 2003)
The zero-delay data warehouse: mobilizing heterogeneous database
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
A method for the mapping of conceptual designs to logical blueprints for ETL processes
Decision Support Systems
Meshing Streaming Updates with Persistent Data in an Active Data Warehouse
IEEE Transactions on Knowledge and Data Engineering
Towards a streaming SQL standard
Proceedings of the VLDB Endowment
An Enhanced Extract-Transform-Load System for Migrating Data in Telecom Billing
ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
The Data Warehouse Lifecycle Toolkit
The Data Warehouse Lifecycle Toolkit
Parallel Real-Time OLAP on Multi-core Processors
CCGRID '12 Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012)
Predictive analytics with surveillance big data
Proceedings of the 1st ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data
Near real-time with traditional data warehouse architectures: factors and how-to
Proceedings of the 17th International Database Engineering & Applications Symposium
Hi-index | 0.00 |
The purpose of a data warehouse is to aid decision making. As the real-time enterprise evolves, synchronism between transactional data and data warehouses is redefined. To cope with real-time requirements, the data warehouses must be able to enable continuous data integration, in order to deal with the most recent business data. Traditional data warehouses are unable to support any dynamics in structure and content while they are available for OLAP. Their data is periodically updated because they are unprepared for continuous data integration. For real-time enterprises with needs in decision support while the transactions are occurring, (near) real-time data warehousing seem very promising. In this paper we present a survey on testing today's most used loading techniques and analyze which are the best data loading methods, presenting a methodology for efficiently supporting continuous data integration for data warehouses. To accomplish this, we use techniques such as table structure replication with minimum content and query predicate restrictions for selecting data, to enable loading data in the data warehouse continuously, with minimum impact in query execution time. We demonstrate the efficiency of the method using benchmark TPC-H and executing query workloads while simultaneously performing continuous data integration.