Integrating schemas of heterogeneous relational databases through schema matching

Authors:
Yaser Karasneh;Hamidah Ibrahim;Mohamed Othman;Razali Yaakob
Affiliations:
Universiti Putra Malaysia, Selangar D. E., Malaysia;Universiti Putra Malaysia, Selangar D. E., Malaysia;Universiti Putra Malaysia, Selangar D. E., Malaysia;Universiti Putra Malaysia, Selangar D. E., Malaysia
Venue:
Proceedings of the 11th International Conference on Information Integration and Web-based Applications & Services
Year:
2009

Citing 19
Cited 0

SEMINT: a tool for identifying attribute correspondences in heterogeneous databases using neural networks

Data & Knowledge Engineering
Heterogeneous database integration in biomedicine

Computers and Biomedical Research
Learning to Match the Schemas of Data Sources: A Multistrategy Approach

Machine Learning
Using Schema Matching to Simplify Heterogeneous Data Translation

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Generic Schema Matching with Cupid

Proceedings of the 27th International Conference on Very Large Data Bases
A survey of approaches to automatic schema matching

The VLDB Journal — The International Journal on Very Large Data Bases
Discovering Direct and Indirect Matches for Schema Elements

DASFAA '03 Proceedings of the Eighth International Conference on Database Systems for Advanced Applications
A Unified Graph-Based Framework for Deriving Nominal Interscheme Properties, Type Conflicts and Object Cluster Similarities

COOPIS '99 Proceedings of the Fourth IECIS International Conference on Cooperative Information Systems
Semi-Automatic, Semantic Discovery of Properties from Database Schemes

IDEAS '98 Proceedings of the 1998 International Symposium on Database Engineering & Applications
Similarity Flooding: A Versatile Graph Matching Algorithm and Its Application to Schema Matching

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Semantic Conflict Resolution Ontology (SCROL): An Ontology for Detecting and Resolving Data and Schema-Level Semantic Conflicts

IEEE Transactions on Knowledge and Data Engineering
iMAP: discovering complex semantic matches between database schemas

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Industrial-strength schema matching

ACM SIGMOD Record
Corpus-Based Schema Matching

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Schema Matching Using Duplicates

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Semantic-integration research in the database community

AI Magazine - Special issue on semantic integration
Poster Session: An Indexing Structure for Automatic Schema Matching

ICDEW '07 Proceedings of the 2007 IEEE 23rd International Conference on Data Engineering Workshop
An experiment on the matching and reuse of XML schemas

ICWE'05 Proceedings of the 5th international conference on Web Engineering
MDSM: Microarray database schema matching using the Hungarian method

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Database integration aims at providing a uniform and consistent view called global schema, over a set of autonomous and heterogeneous data sources, so that data residing in different sources can be accessed as if it was in a single schema. The integration of databases' schemas can be performed in four main phases, namely: preintegration, comparing of the schemas, conforming of the schemas, and merging and restructuring. The second and third phases can be combined and normally is known as schema matching. Schema matching, the focus of this paper, is a fundamental operation in the manipulation of schema in formatting match, which takes two schemas that correspond semantically to each other. Manually specifying schema matches is a tedious, time consuming, error-prone, and therefore expensive process, which is a growing problem given the rapidly increasing number of data sources to integrate. As systems become able to handle more complex databases and applications, their schemas become large, further increasing the number of matches to be performed. Thus, automating this process, which attempts to achieve faster and less labor-intensive, has been one of the main tasks in data integration. However, it is not possible to determine fully automatically the different correspondences between schemas, primarily because of the differing and often not explicated or documented semantics of the schemas. Several solutions in solving the issues of schema matching have been proposed. Nevertheless, these solutions are still limited, as they do not explore most of the available information related to schemas and thus affect the result of integration. This paper presents an approach for integrating heterogeneous relational databases' schemas through schema matching that utilizes most of the information related to schemas, which indirectly explores the implicit semantics of the schemas, that further improves the results of the integration. This paper also shows that the produced integrated schemas (global schemas) maintained the properties of the initial input schemas and also the characteristics of the relational model.