A genetic algorithm-based rule extraction system

  • Authors:
  • Bikash Kanti Sarkar;Shib Sankar Sana;Kripasindhu Chaudhuri

  • Affiliations:
  • Department of Information Technology, B.I.T., Mesra, Ranchi 835 215, Jharkhand, India;Department of Mathematics, Bhangar Mahavidyalaya(C.U.), Bhangar 743 502 24-Pgs(S), W.B., India;Department of Mathematics, Jadavpur University, Kolkata 32, India

  • Venue:
  • Applied Soft Computing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Individual classifiers predict unknown objects. Although, these are usually domain specific, and lack the property of scaling up prediction while handling data sets with huge size and high-dimensionality or imbalance class distribution. This article introduces an accuracy-based learning system called DTGA (decision tree and genetic algorithm) that aims to improve prediction accuracy over any classification problem irrespective to domain, size, dimensionality and class distribution. More specifically, the proposed system consists of two rule inducing phases. In the first phase, a base classifier, C4.5 (a decision tree based rule inducer) is used to produce rules from training data set, whereas GA (genetic algorithm) in the next phase refines them with the aim to provide more accurate and high-performance rules for prediction. The system has been compared with competent non-GA based systems: neural network, Naive Bayes, rule-based classifier using rough set theory and C4.5 (i.e., the base classifier of DTGA), on a number of benchmark datasets collected from UCI (University of California at Irvine) machine learning repository. Empirical results demonstrate that the proposed hybrid approach provides marked improvement in a number of cases.