A semi-supervised approach for reject inference in credit scoring using SVMs

Authors:
Sebastián Maldonado;Gonzalo Paredes
Affiliations:
Department of Industrial Engineering, University of Chile;Department of Industrial Engineering, University of Chile
Venue:
ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
Year:
2010

Citing 11
Cited 0

A theory of the learnable

Communications of the ACM
On the exponential value of labeled samples

Pattern Recognition Letters
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Analyzing the effectiveness and applicability of co-training

Proceedings of the ninth international conference on Information and knowledge management
Transductive Inference for Text Classification using Support Vector Machines

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Trading convexity for scalability

ICML '06 Proceedings of the 23rd international conference on Machine learning
Credit Risk Scorecards: Developing And Implementing Intelligent Credit Scoring

Credit Risk Scorecards: Developing And Implementing Intelligent Credit Scoring
Co-training for predicting emotions with spoken dialogue data

ACLdemo '04 Proceedings of the ACL 2004 on Interactive poster and demonstration sessions
Two-view feature generation model for semi-supervised learning

Proceedings of the 24th international conference on Machine learning
A wrapper method for feature selection using Support Vector Machines

Information Sciences: an International Journal
Semi-Supervised Learning

Semi-Supervised Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a novel semi-supervised approach that determines a linear predictor using Support Vector Machines (SVMs) and incorporates information on rejected loans, assuming that the labeled data (accepted applicants) and unlabeled data (rejected applicants) are not drawn from the same distribution. We use a self-training algorithm in order to predict how likely a rejected applicant would have repaid had the applicant received credit. A modification to the self-training algorithm based on Platt's probabilistic output for SVMs is introduced. Experiments with two toy data sets; one well-known benchmark Credit Scoring data set, and one project performed for a Chilean financial institution demonstrate that our approach accomplishes the best classification performance compared to well-known reject inference alternatives and another state-of-the-art semi-supervised method for SVMs (Transductive SVM).