Anatomy of data integration

  • Authors:
  • Olga Brazhnik;John F. Jones

  • Affiliations:
  • Center for Information Technology, National Institutes of Health, 10401 Fernwood Road, Room 3NW03, Bethesda, MD 20817, USA;Center for Information Technology, National Institutes of Health, 10401 Fernwood Road, Room 3NW03, Bethesda, MD 20817, USA

  • Venue:
  • Journal of Biomedical Informatics
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Producing reliable information is the ultimate goal of data processing. The ocean of data created with the advances of science and technologies calls for integration of data coming from heterogeneous sources that are diverse in their purposes, business rules, underlying models and enabling technologies. Reference models, Semantic Web, standards, ontology, and other technologies enable fast and efficient merging of heterogeneous data, while the reliability of produced information is largely defined by how well the data represent the reality. In this paper, we initiate a framework for assessing the informational value of data that includes data dimensions; aligning data quality with business practices; identifying authoritative sources and integration keys; merging models; uniting updates of varying frequency and overlapping or gapped data sets.