SIS—A schema integration system
on Proceedings of the Fifth British National Conference on Databases (BNCOD 5)
A comparative analysis of methodologies for database schema integration
ACM Computing Surveys (CSUR)
Federated database systems for managing distributed, heterogeneous, and autonomous databases
ACM Computing Surveys (CSUR) - Special issue on heterogeneous databases
The breakdown of the information model in multi-database systems
ACM SIGMOD Record
Data manipulation in heterogeneous databases
ACM SIGMOD Record
Fundamentals of database systems (2nd ed.)
Fundamentals of database systems (2nd ed.)
Using the data warehouse
An algebraic transformation framework for multidatabase queries
Distributed and Parallel Databases
Migrating legacy systems: gateways, interfaces & the incremental approach
Migrating legacy systems: gateways, interfaces & the incremental approach
Research problems in data warehousing
CIKM '95 Proceedings of the fourth international conference on Information and knowledge management
The merge/purge problem for large databases
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Implementing data cubes efficiently
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Building the Data Warehouse
Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals
Data Mining and Knowledge Discovery
IEEE Transactions on Knowledge and Data Engineering
An Evidential Reasoning Approach to Attribute Value Conflict Resolution in Database Integration
IEEE Transactions on Knowledge and Data Engineering
The Inter-Database Instance Identification Problem in Integrating Autonomous Systems
Proceedings of the Fifth International Conference on Data Engineering
Multi-User View Integration System (MUVIS): An Expert System for View Integration
Proceedings of the Sixth International Conference on Data Engineering
Entity Identification in Database Integration
Proceedings of the Ninth International Conference on Data Engineering
Aggregate-Query Processing in Data Warehousing Environments
VLDB '95 Proceedings of the 21th International Conference on Very Large Data Bases
Processing Queries Over Generalization Hierarchies in a Multidatabase System
VLDB '83 Proceedings of the 9th International Conference on Very Large Data Bases
Data warehousing: warehouse administration
Handbook of data mining and knowledge discovery
A multidimensional data warehouse development methodology
Managing data mining technologies in organizations
A methodology for datawarehouse design: conceptual modeling
Data warehousing and web engineering
The Catch data warehouse: support for community health care decision-making
Decision Support Systems
Integrated decision support systems: A data warehousing perspective
Decision Support Systems
Hi-index | 0.00 |
Data warehousing is gaining in popularity as organizations realize the benefits of being able to perform sophisticated analyses of their data. Recent years have seen the introduction of a number of data-warehousing engines, from both established database vendors as well as new players. The engines themselves are relatively easy to use and come with a good set of end-user tools. However, there is one key stumbling block to the rapid development of data warehouses, namely that of warehouse population. Specifically, problems arise in populating a warehouse with existing data since it has various types of heterogeneity. Given the lack of good tools, this task has generally been performed by various system integrators, e.g., software consulting organizations which have developed in-house tools and processes for the task. The general conclusion is that the task has proven to be labor-intensive, error-prone, and generally frustrating, leading a number of warehousing projects to be abandoned mid-way through development. However, the picture is not as grim as it appears. The problems that are being encountered in warehouse creation are very similar to those encountered in data integration, and they have been studied for about two decades. However, not all problems relevant to warehouse creation have been solved, and a number of research issues remain. The principal goal of this paper is to identify the common issues in data integration and data-warehouse creation. We hope this will lead: 1) developers of warehouse creation tools to examine and, where appropriate, incorporate the techniques developed for data integration, and 2) researchers in both the data integration and the data warehousing communities to address the open research issues in this important area.