AUTOMATIC DOMAIN ONTOLOGY GENERATION FROM WEB SITES

  • Authors:
  • Tak-Lam Wong;Wai Lam;Enhong Chen

  • Affiliations:
  • Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, Hong Kong;Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Shatin, Hong Kong;Department of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, 230027, P.R. China

  • Venue:
  • Journal of Integrated Design & Process Science
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Ontology plays an important role in semantic Web technology since it can effectively represent the domain knowledge. We develop a novel framework for automatically generating the domain knowledge by analyzing different Web sites in a given domain. The idea of our approach is to consider two kinds of information from the Web sites. The first kind of information is the text fragments corresponding to the concepts in the ontology. The other kind of information is the header labels corresponding to the concepts. We design a method for generating the domain ontology by measuring the similarity between the concepts in different Web sites. We have conducted extensive experiments to demonstrate the effectiveness of our approach.