Improving clustering analysis for credit card accounts classification

Authors:
Yi Peng;Gang Kou;Yong Shi;Zhengxin Chen
Affiliations:
College of Information Science & Technology, University of Nebraska at Omaha, Omaha, NE;College of Information Science & Technology, University of Nebraska at Omaha, Omaha, NE;College of Information Science & Technology, University of Nebraska at Omaha, Omaha, NE;College of Information Science & Technology, University of Nebraska at Omaha, Omaha, NE
Venue:
ICCS'05 Proceedings of the 5th international conference on Computational Science - Volume Part III
Year:
2005

Citing 1
Cited 3

Data mining: concepts and techniques

Data mining: concepts and techniques

A Multi-criteria Convex Quadratic Programming model for credit data analysis

Decision Support Systems
Knowledge-Rich Data Mining in Financial Risk Detection

ICCS 2009 Proceedings of the 9th International Conference on Computational Science
Personal bankruptcy prediction by mining credit card data

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

In credit card portfolio management, predicting the cardholders’ behavior is a key to reduce the charge off risk of credit card issuers. The most commonly used methods in predicting credit card defaulters are credit scoring models. Most of these credit scoring models use supervised classification methods. Although these methods have made considerable progress in bankruptcy prediction, they are unsuitable for data records without predefined class labels. Therefore, it is worthwhile to investigate the applicability of unsupervised learning methods in credit card accounts classification. The objectives of this paper are: (1) to explore an unsupervised learning method: cluster analysis, for credit card accounts classification, (2) to improve clustering classification results using ensemble and supervised learning methods. In particular, a general purpose clustering toolkit, CLUTO, from university of Minnesota, was used to classify a real-life credit card dataset and two supervised classification methods, decision tree and multiple-criteria linear programming (MCLP), were used to improve the clustering results. The classification results indicate that clustering can be used to either as a stand-alone classification method or as a preprocess step for supervised classification methods.