A Transactional Approach to Parallel Data Warehouse Maintenance

  • Authors:
  • Bin Liu;Songting Chen;Elke A. Rundensteiner

  • Affiliations:
  • -;-;-

  • Venue:
  • DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data Warehousing is becoming an increasingly important technology for information integration and data analysis. Given the dynamic nature of modern distributed environments, both source data and schema changes are likely to occur autonomously and even concurrently in different sources. We have thus developed a comprehensive solution approach, called TxnWrap, that successfully maintains the warehouse views under any type of concurrent source updates. In this work, we now overcome TxnWrap's restriction that the maintenance is processed one by one for each source update, since that limits the performance. To overcome this limitation, we exploit the transactional approach of TxnWrap to achieve parallel data warehouse maintenance. For this, we first identify the read/write conflicts among the different warehouse maintenance processes. We then propose a parallel maintenance scheduler (PMS) that generates legal schedules that resolve these conflicts. PMS has been implemented and incorporated into our TxnWrap system. The experimental results confirm that our parallel maintenance scheduler significantly improves the performance of data warehouse maintenance.