Rough set Based Ensemble Classifier forWeb Page Classification

  • Authors:
  • Suman Saha;C.A. Murthy;Sankar K. Pal

  • Affiliations:
  • Center for Soft Computing Research, Indian Statistical Institute, India. E-mail: {ssaha_r,murthy,sankar}@isical.ac.in;Center for Soft Computing Research, Indian Statistical Institute, India. E-mail: {ssaha_r,murthy,sankar}@isical.ac.in;Center for Soft Computing Research, Indian Statistical Institute, India. E-mail: {ssaha_r,murthy,sankar}@isical.ac.in

  • Venue:
  • Fundamenta Informaticae
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Combining the results of a number of individually trained classification systems to obtain a more accurate classifier is a widely used technique in pattern recognition. In this article, we have introduced a rough set based meta classifier to classify web pages. The proposed method consists of two parts. In the first part, the output of every individual classifier is considered for constructing a decision table. In the second part, rough set attribute reduction and rule generation processes are used on the decision table to construct a meta classifier. It has been shown that (1) the performance of the meta classifier is better than the performance of every constituent classifier and, (2) the meta classifier is optimal with respect to a quality measure defined in the article. Experimental studies show that the meta classifier improves accuracy of classification uniformly over some benchmark corpora and beats other ensemble approaches in accuracy by a decisive margin, thus demonstrating the theoretical results. Apart from this, it reduces the CPU load compared to other ensemble classification techniques by removing redundant classifiers from the combination.