Automatic multi-schema integration based on user preference

  • Authors:
  • Guohui Ding;Guoren Wang;Junchang Xin;Huichao Geng

  • Affiliations:
  • Key Laboratory of Medical Image Computing, Ministry of Education and College of Information Science & Engineering, Northeastern University, China;Key Laboratory of Medical Image Computing, Ministry of Education and College of Information Science & Engineering, Northeastern University, China;Key Laboratory of Medical Image Computing, Ministry of Education and College of Information Science & Engineering, Northeastern University, China;College of Information Science & Engineering, Northeastern University, China

  • Venue:
  • WAIM'10 Proceedings of the 11th international conference on Web-age information management
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Schema integration plays a central role in numerous database applications, such as Deep Web, DataSpaces and Ontology Merging. Although there have been many researches on schema integration, they all neglect user preference which is a very important factor for improving the quality of mediated schemas. In this paper, we propose the automatic multi-schema integration based on user preference. A new concept named reference schema is introduced to represent user preference. This concept can guide the process of integration to generate mediated schemas according to user preference. Different from previous solutions, our approach employs F-measure and "attribute density" to measure the similarity between schemas. Based on this similarity, we design a top-k ranking algorithm that retrieves k mediate schemas which users really expect. The key component of the algorithm is a pruning strategy which makes use of Divide and Conquer to narrow down the search space of the candidate schemas. Finally, the experimental study demonstrates the effectiveness and good performance of our approach.