Feature selection in bankruptcy prediction

  • Authors:
  • Chih-Fong Tsai

  • Affiliations:
  • Department of Information Management, National Central University, 300 Jhongda Road, Jhongli 32001, Taiwan

  • Venue:
  • Knowledge-Based Systems
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

For many corporations, assessing the credit of investment targets and the possibility of bankruptcy is a vital issue before investment. Data mining and machine learning techniques have been applied to solve the bankruptcy prediction and credit scoring problems. As feature selection is an important step to select more representative data from a given dataset in data mining to improve the final prediction performance, it is unknown that which feature selection method is better. Therefore, this paper aims at comparing five well-known feature selection methods used in bankruptcy prediction, which are t-test, correlation matrix, stepwise regression, principle component analysis (PCA) and factor analysis (FA) to examine their prediction performance. Multi-layer perceptron (MLP) neural networks are used as the prediction model. Five related datasets are used in order to provide a reliable conclusion. Regarding the experimental results, the t-test feature selection method outperforms the other ones by the two performance measurements.