Automatic Classification of Web Information Based on Site Structure

  • Authors:
  • Gao Kening;Yang Leiming;Zhang Bin;Chai Qiaozi;Ma Anxiang

  • Affiliations:
  • Northeastern University , Shenyang, China;Northeastern University , Shenyang, China;Northeastern University , Shenyang, China;Northeastern University , Shenyang, China;Northeastern University , Shenyang, China

  • Venue:
  • CW '05 Proceedings of the 2005 International Conference on Cyberworlds
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

How to classify automatically web information that grows explosive is becoming an imminent problem needed to be resolved. Based on site structure, we propose, in this paper, a new mechanism of automatic classification of web information, which downloads web pages within a web site, records the hyperlinks among web pages, catches the site structure, extracts the classifying system of the site itself, and then links categorizing information with the correspondent position in the site structure. Therefore automatic classification of web information can be realized through matching the positions of categorizing information with the positions of web pages. Experiments show that such classification based on site structure works more accurately and efficiently.