Building Domain Ontology Based on Web Data and Generic Ontology

  • Authors:
  • Jie Yang;Lei Wang;Song Zhang;Xin Sui;Ning Zhang;Zhuoqun Xu

  • Affiliations:
  • Peking Univ., Beijing, China;Peking Univ., Beijing, China;Peking Univ., Beijing, China;Peking Univ., Beijing, China;Peking Univ., Beijing, China;Peking Univ., Beijing, China

  • Venue:
  • WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
  • Year:
  • 2004

Quantified Score

Hi-index 0.01

Visualization

Abstract

The automatic or semi-automatic construction of ontology has become a research topic of interest in recent years. This paper describes a mechanism for constructing domain specific ontologies automatically based on web data and generic ontology.Firstly, we employ the hierarchical agglomerative clustering(HAC) algorithm, clustering web pages hierarchically and resulting in a binary tree.Then an algorithm is proposed, which selectd from the binary tree the significative nodes as topics implying concepts of domain interests.Lastly, the Chinese generic ontology, HowNet, is introduced to evolve the topics (together with their hierarchical structures) into domain ontology.We experiment our method in the field of computer hardware based on web pages collected from Chinese BtoC web sites.An in-depth discussion on the experiment results is also given.