Incomplete Information in Relational Databases
Journal of the ACM (JACM)
Relative information capacity of simple relational database schemata
SIAM Journal on Computing
A comparative analysis of methodologies for database schema integration
ACM Computing Surveys (CSUR)
On conjunctive queries containing inequalities
Journal of the ACM (JACM)
Language features for interoperability of databases with schematic discrepancies
SIGMOD '91 Proceedings of the 1991 ACM SIGMOD international conference on Management of data
ACM SIGART Bulletin - Special issue on implemented knowledge representation and reasoning systems
Structural schema integration with full and partial correspondence using the dual model
Information Systems - Data bases: their creation, management, and utilization
The complexity of querying indefinite information: defined relations, recursion and linear order
The complexity of querying indefinite information: defined relations, recursion and linear order
Coordinating context building in heterogeneous information systems
Journal of Intelligent Information Systems - Special issue on next generation information technologies
Schema equivalence in heterogeneous systems: bridging theory and practice
Information Systems - Special issue on extending database technology
Schema standardization as an aid in view integration
CAISE '93 Selected papers from the fifth international conference on Advanced information systems engineering
Preserving update semantics in schema integration
CIKM '94 Proceedings of the third international conference on Information and knowledge management
Data model and query evaluation in global information systems
Journal of Intelligent Information Systems - Special issue: networked information discovery and retrieval
Answering queries using views (extended abstract)
PODS '95 Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Answering queries using templates with binding patterns (extended abstract)
PODS '95 Proceedings of the fourteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
The merge/purge problem for large databases
SIGMOD '95 Proceedings of the 1995 ACM SIGMOD international conference on Management of data
Building the data warehouse (2nd ed.)
Building the data warehouse (2nd ed.)
A framework for supporting data integration using the materialized and virtual approaches
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Generating data integration mediators that use materialization
Journal of Intelligent Information Systems - Special issue on intelligent integration of information
Answering queries using limited external query processors (extended abstract)
PODS '96 Proceedings of the fifteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
The object database standard: ODMG 2.0
The object database standard: ODMG 2.0
Rewriting queries using views in description logics
PODS '97 Proceedings of the sixteenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Complexity of answering queries using materialized views
PODS '98 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems
Logical approaches to incomplete information: a survey
Logics for databases and information systems
What can knowledge representation do for semi-structured data?
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Rewriting aggregate queries using views
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Rewriting of regular expressions and regular path queries
PODS '99 Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Query optimization in the presence of limited access patterns
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Computing capabilities of mediators
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Query rewriting for semistructured data
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Duplicate record elimination in large data files
ACM Transactions on Database Systems (TODS)
Semi-automatic techniques for deriving interscheme properties from database schemes
Data & Knowledge Engineering
SAC '97 Proceedings of the 1997 ACM symposium on Applied computing
View-based query processing for regular path queries with inverse
PODS '00 Proceedings of the nineteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Theory of answering queries using views
ACM SIGMOD Record
Semantic integration of heterogeneous information sources
Data & Knowledge Engineering - Special issue on heterogeneous information resources need semantic access
A knowledge-based approach for duplicate elimination in data cleaning
Information Systems - Data extraction, cleaning and reconciliation
Data Warehouse: From Architecture to Implementation
Data Warehouse: From Architecture to Implementation
Fundamentals of Data Warehouses
Fundamentals of Data Warehouses
Logic-based techniques in data integration
Logic-based artificial intelligence
Model independent assertions for integration of heterogeneous schemas
The VLDB Journal — The International Journal on Very Large Data Bases
System-Guided View Integration for Object-Oriented Databases
IEEE Transactions on Knowledge and Data Engineering
A Methodology for Integration of Heterogeneous Databases
IEEE Transactions on Knowledge and Data Engineering
Correct Schema Transformations
EDBT '96 Proceedings of the 5th International Conference on Extending Database Technology: Advances in Database Technology
Query Folding with Inclusion Dependencies
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
A Structure Based Schema Integration Methodology
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Optimizing Queries with Materialized Views
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
MedMaker: A Mediation System Based on Declarative Specifications
ICDE '96 Proceedings of the Twelfth International Conference on Data Engineering
Information Integration Using Logical Views
ICDT '97 Proceedings of the 6th International Conference on Database Theory
A Scalable Algorithm for Answering Queries Using Views
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Information Integration: The MOMIS Project Demonstration
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Generic Schema Matching with Cupid
Proceedings of the 27th International Conference on Very Large Data Bases
Querying Heterogeneous Information Sources Using Source Descriptions
VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Data Integration under Integrity Constraints
CAiSE '02 Proceedings of the 14th International Conference on Advanced Information Systems Engineering
Answering Queries Using Views over Description Logics Knowledge Bases
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Accessing Data Integration Systems through Conceptual Schemas
ER '01 Proceedings of the 20th International Conference on Conceptual Modeling: Conceptual Modeling
On the Expressive Power of Data Integration Systems
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
The GMAP: a versatile tool for physical data independence
The VLDB Journal — The International Journal on Very Large Data Bases
A predicate-based caching scheme for client-server database architectures
The VLDB Journal — The International Journal on Very Large Data Bases
Optimal implementation of conjunctive queries in relational data bases
STOC '77 Proceedings of the ninth annual ACM symposium on Theory of computing
Towards heterogeneous multimedia information systems: the Garlic approach
RIDE '95 Proceedings of the 5th International Workshop on Research Issues in Data Engineering-Distributed Object Management (RIDE-DOM'95)
Answering Regular Path Queries Using Views
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Query Planning with Limited Source Capabilities
ICDE '00 Proceedings of the 16th International Conference on Data Engineering
Planning to gather inforrnation
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Query-answering algorithms for information agents
AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1
Hi-index | 0.00 |
While the main goal of a data warehouse is to provide support for data analysis and management's decisions, a fundamental aspect in design of a data warehouse system is the process of acquiring the raw data from a set of relevant information sources. We will call source integration system the component of a data warehouse system dealing with this process. The main goal of a source integration system is to deal with the transfer of data from the set of sources constituting the application-oriented operational environment, to the data warehouse. Since sources are typically autonomous, distributed, and heterogeneous, this task has to deal with the problem of cleaning, reconciling, and integrating data coming from the sources. The design of a source integration system is a very complex task, which comprises several different issues. The purpose of this chapter is to discuss the most important problems arising in the design of a source integration system, with special emphasis on schema integration, processing queries for data integration, and data cleaning and reconciliation.