Rough set and scatter search metaheuristic based feature selection for credit scoring

  • Authors:
  • Jue Wang;Abdel-Rahman Hedar;Shouyang Wang;Jian Ma

  • Affiliations:
  • Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, PR China;Department of Computer Science, Faculty of Computers and Information, Assiut University, Egypt;Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, PR China;Department of Information Systems, City University of Hong Kong, Hong Kong

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

As the credit industry has been growing rapidly, credit scoring models have been widely used by the financial industry during this time to improve cash flow and credit collections. However, a large amount of redundant information and features are involved in the credit dataset, which leads to lower accuracy and higher complexity of the credit scoring model. So, effective feature selection methods are necessary for credit dataset with huge number of features. In this paper, a novel approach, called RSFS, to feature selection based on rough set and scatter search is proposed. In RSFS, conditional entropy is regarded as the heuristic to search the optimal solutions. Two credit datasets in UCI database are selected to demonstrate the competitive performance of RSFS consisted in three credit models including neural network model, J48 decision tree and Logistic regression. The experimental result shows that RSFS has a superior performance in saving the computational costs and improving classification accuracy compared with the base classification methods.