Binary Cybergenre Classification Using Theoretic Feature Measures

  • Authors:
  • Lei Dong;Carolyn Watters;Jack Duffy;Michael Shepherd

  • Affiliations:
  • Dalhousie University, Canada;Dalhousie University, Canada;Dalhousie University, Canada;Dalhousie University, Canada

  • Venue:
  • WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this study, we conducted an investigation on automatic genre classification for three common types of web pages addressing the effect of three theoretic feature selection measures, a range of feature set size, and three machine classifiers on the accuracy of the web page classification in the context of a set of controlled experiments. Our results are encouraging and we conclude that for binary classification tasks, at least for these web page genres, it is possible to reach satisfying results with small content-based feature sets generated with a sound feature selection measure and furthermore there is no evidence of interaction between these feature selection measures and the machine classifiers used.