Schema AND Data: A Holistic Approach to Mapping, Resolution and Fusion in Information Integration

  • Authors:
  • Laura M. Haas;Martin Hentschel;Donald Kossmann;Renée J. Miller

  • Affiliations:
  • IBM Almaden Research Center, San Jose, USA 95120;Systems Group, ETH Zurich, Switzerland;Systems Group, ETH Zurich, Switzerland;Department of Computer Science, University of Toronto, Canada

  • Venue:
  • ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

To integrate information, data in different formats, from dif- ferent, potentially overlapping sources, must be related and transformed to meet the users' needs. Ten years ago, Clio introduced nonprocedural schema mappings to describe the relationship between data in heteroge- neous schemas. This enabled powerful tools for mapping discovery and integration code generation, greatly simplifying the integration process. However, further progress is needed. We see an opportunity to raise the level of abstraction further, to encompass both data- and schema-centric integration tasks and to isolate applications from the details of how the integration is accomplished. Holistic information integration supports it- eration across the various integration tasks, leveraging information about both schema and data to improve the integrated result. Integration inde- pendence allows applications to be independent of how, when, and where information integration takes place, making materialization and the tim- ing of transformations an optimization decision that is transparent to applications. In this paper, we define these two important goals, and propose leveraging data mappings to create a framework that supports both data- and schema-level integration tasks.