Learning to integrate web catalogs with conceptual relationships in hierarchical thesaurus

  • Authors:
  • Jui-Chi Ho;Ing-Xiang Chen;Cheng-Zen Yang

  • Affiliations:
  • Department of Computer Science and Engineering, Yuan Ze University, Taiwan, R.O.C.;Department of Computer Science and Engineering, Yuan Ze University, Taiwan, R.O.C.;Department of Computer Science and Engineering, Yuan Ze University, Taiwan, R.O.C.

  • Venue:
  • AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Web catalog integration has been addressed as an important issue in current digital content management. Past studies have shown that exploiting a flattened structure with auxiliary information extracted from the source catalog can improve the integration results. Although earlier studies have also shown that exploiting a hierarchical structure in classification may bring better advantages, the effectiveness has not been testified in catalog integration. In this paper, we propose an enhanced catalog integration (ECI) approach to extract the conceptual relationships from the hierarchical Web thesaurus and further improve the accuracy of Web catalog integration. We have conducted experiments of real-world catalog integration with both a flattened structure and a hierarchical structure in the destination catalog. The results show that our ECI scheme effectively boosts the integration accuracy of both the flattened scheme and the hierarchical scheme with the advanced Support Vector Machine (SVM) classifiers.