Automatic web pages hierarchical classification using dynamic domain ontologies

  • Authors:
  • Antonio M. Rinaldi

  • Affiliations:
  • Dipartimento di Informatica e Sistemistica, Università di Napoli Federico II, 80125 Via Claudio, 21, Napoli, Italy

  • Venue:
  • International Journal of Knowledge and Web Intelligence
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The use of ontologies for knowledge representation has had a fast increase in the last years and they are used in several application context. One of these challenging applications is the web. Managing large amount of information on internet needs more efficient and effective methods and techniques for mining and representing information. In this article, we present a methodology for automatic topic annotation of web pages. We describe an algorithm for words disambiguation using an apposite metric for measuring the semantic relatedness and we show a technique which allows to detect the topic of the analysed document using ontologies extracted from a knowledge base. The strategy is implemented in a system where these information are used to build a topic hierarchy automatically created and not a priori defined for classifying web pages. Experimental results are presented and discussed in order to measure the effectiveness of our approach.