A Voting Method for the Classification of Web Pages

  • Authors:
  • Rui Fang;Alexander Mikroyannidis;Babis Theodoulidis

  • Affiliations:
  • University of Manchester, United Kingdom;University of Manchester, United Kingdom;University of Manchester, United Kingdom

  • Venue:
  • WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper discusses web page classification using hypertext features such as the text included in the web page, the title, headings, URL, and anchor text. Five different classification approaches based on SVM that use individual features or combinations are investigated on the LookSmart dataset. The initial experimental results have shown that combining the features improves the performance of the classifier and that some features such as title and headings can be very useful for certain tasks. On the basis of this analysis, we propose a voting method that further improves the performance compared with the individual classifiers.