Hybrid system with genetic algorithm and artificial neural networks and its application to retail credit risk assessment

Authors:
Stjepan Oreski;Dijana Oreski;Goran Oreski
Affiliations:
Bank of Karlovac, I.G.Kovacica 1, 47000 Karlovac, Croatia;Faculty of Organization and Informatics, University of Zagreb, Pavlinska 2, 42000 Varadin, Croatia;Bank of Karlovac, I.G.Kovacica 1, 47000 Karlovac, Croatia
Venue:
Expert Systems with Applications: An International Journal
Year:
2012

Citing 9
Cited 3

Wrappers for feature subset selection

Artificial Intelligence - Special issue on relevance
Data Mining: Concepts and Techniques

Data Mining: Concepts and Techniques
Credit scoring with a data mining approach based on support vector machines

Expert Systems with Applications: An International Journal
Consumer credit scoring models with limited data

Expert Systems with Applications: An International Journal
Comparison procedure of predicting the time to default in behavioural scoring

Expert Systems with Applications: An International Journal
The consumer loan default predicting model - An application of DEA-DA and neural network

Expert Systems with Applications: An International Journal
Multiple classifier application to credit risk assessment

Expert Systems with Applications: An International Journal
The evaluation of consumer loans using support vector machines

Expert Systems with Applications: An International Journal
Neural networks for credit risk evaluation: Investigation of different neural models and learning schemes

Expert Systems with Applications: An International Journal

The bank loan approval decision from multiple perspectives

Expert Systems with Applications: An International Journal
Hybrid genetic algorithms for stress recognition in reading

EvoBIO'13 Proceedings of the 11th European conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics
Genetic algorithm-based heuristic for feature selection in credit risk assessment

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.05

Visualization

Abstract

The databases of the banks around the world have accumulated large quantities of information about clients and their financial and payment history. These databases can be used for the credit risk assessment, but they are commonly high dimensional. Irrelevant features in a training dataset may produce less accurate results of classification analysis. Data preprocessing is required to prepare the data for classification to increase the predictive accuracy. Feature selection is a preprocessing technique commonly used on high dimensional data and its purposes include reducing dimensionality, removing irrelevant and redundant features, facilitating data understanding, reducing the amount of data needed for learning, improving predictive accuracy of algorithms, and increasing interpretability of models. In this paper we investigate the extent to which the total data, owned by a bank, can be a good basis for predicting the borrower's ability to repay the loan on time. We propose a feature selection technique for finding an optimum feature subset that enhances the classification accuracy of neural network classifiers. Experiments were conducted on the credit dataset collected at a Croatian bank to assess the accuracy of our technique. We found that the hybrid system with genetic algorithm is competitive and can be used as feature selection technique to discover the most significant features in determining risk of default.