Learning without default: a study of one-class classification and the low-default portfolio problem

  • Authors:
  • Kenneth Kennedy;Brian Mac Namee;Sarah Jane Delany

  • Affiliations:
  • School of Computing, Dublin Institute of Technology, Dublin, Ireland;School of Computing, Dublin Institute of Technology, Dublin, Ireland;Digital Media Centre, Dublin Institute of Technology, Dublin, Ireland

  • Venue:
  • AICS'09 Proceedings of the 20th Irish conference on Artificial intelligence and cognitive science
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper asks at what level of class imbalance one-class classifiers outperform two-class classifiers in credit scoring problems in which class imbalance, referred to as the low-default portfolio problem, is a serious issue. The question is answered by comparing the performance of a variety of one-class and two-class classifiers on a selection of credit scoring datasets as the class imbalance is manipulated. We also include random oversampling as this is one of the most common approaches to addressing class imbalance. This study analyses the suitability and performance of recognised two-class classifiers and one-class classifiers. Based on our study we conclude that the performance of the two-class classifiers deteriorates proportionally to the level of class imbalance. The two-class classifiers outperform one-class classifiers with class imbalance levels down as far as 15% (i.e. the imbalance ratio of minority class to majority class is 15:85). The one-class classifiers, whose performance remains unvaried throughout, are preferred when the minority class constitutes approximately 2% or less of the data. Between an imbalance of 2% to 15% the results are not as conclusive. These results show that one-class classifiers could potentially be used as a solution to the low-default portfolio problem experienced in the credit scoring domain.