A decision rule-based method for feature selection in predictive data mining

  • Authors:
  • Patricia E. N. Lutu;Andries P. Engelbrecht

  • Affiliations:
  • Department of Informatics, University of Pretoria, South Africa;Department of Computer Science, University of Pretoria, South Africa

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2010

Quantified Score

Hi-index 12.05

Visualization

Abstract

Algorithms for feature selection in predictive data mining for classification problems attempt to select those features that are relevant, and are not redundant for the classification task. A relevant feature is defined as one which is highly correlated with the target function. One problem with the definition of feature relevance is that there is no universally accepted definition of what it means for a feature to be 'highly correlated with the target function or highly correlated with the other features'. A new feature selection algorithm which incorporates domain specific definitions of high, medium and low correlations is proposed in this paper. The proposed algorithm conducts a heuristic search for the most relevant features for the prediction task.