Reconciling performance and interpretability in customer churn prediction using ensemble learning based on generalized additive models

Authors:
Koen W. De Bock;Dirk Van den Poel
Affiliations:
IESEG School of Management, Université Catholique de Lille (LEM, UMR CNRS 8179), Department of Marketing, 3 Rue de la Digue, F-59000 Lille, France and Ghent University, Faculty of Economics a ...;Ghent University, Faculty of Economics and Business Administration, Department of Marketing, Tweekerkenstraat 2, B-9000 Ghent, Belgium
Venue:
Expert Systems with Applications: An International Journal
Year:
2012

Citing 25
Cited 1

Bagging predictors

Machine Learning
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
An approach to the automatic design of multiple classifier systems

Pattern Recognition Letters - Special issue on machine learning and data mining in pattern recognition
Knowledge management and data mining for marketing

Decision Support Systems - Knowledge management support of decision making
Random Forests

Machine Learning
CHAMP: A Prototype for Automated Cellular Churn Prediction

Data Mining and Knowledge Discovery
The Case against Accuracy Estimation for Comparing Induction Algorithms

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Crafting Papers on Machine Learning

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Combining Pattern Classifiers: Methods and Algorithms

Combining Pattern Classifiers: Methods and Algorithms
Mining with rarity: a unifying framework

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
BagBoosting for tumor classification with gene expression data

Bioinformatics
Bankruptcy prediction by generalized additive models: Research Articles

Applied Stochastic Models in Business and Industry
Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques

Expert Systems with Applications: An International Journal
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Improving customer complaint management by automatic email classification using linguistic style features as predictors

Decision Support Systems
Handling class imbalance in customer churn prediction

Expert Systems with Applications: An International Journal
Customer churn prediction using improved balanced random forests

Expert Systems with Applications: An International Journal
Improved marketing decision making in a customer churn prediction context using generalized additive models

Expert Systems with Applications: An International Journal
Predicting customer retention and profitability by using random forests and regression forests techniques

Expert Systems with Applications: An International Journal
Ensemble classification based on generalized additive models

Computational Statistics & Data Analysis
An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction

Expert Systems with Applications: An International Journal
Using generalized additive models for construction of nonlinear classifiers in computer-aided diagnosis systems

IEEE Transactions on Information Technology in Biomedicine
Evolutionary computing for knowledge discovery in medical diagnosis

Artificial Intelligence in Medicine
Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry

IEEE Transactions on Neural Networks
Recognizing partially occluded, expression variant faces from single training image per person with SOM and soft k-NN ensemble

IEEE Transactions on Neural Networks

Improved multilevel security with latent semantic indexing

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	12.05

Visualization

Abstract

To build a successful customer churn prediction model, a classification algorithm should be chosen that fulfills two requirements: strong classification performance and a high level of model interpretability. In recent literature, ensemble classifiers have demonstrated superior performance in a multitude of applications and data mining contests. However, due to an increased complexity they result in models that are often difficult to interpret. In this study, GAMensPlus, an ensemble classifier based upon generalized additive models (GAMs), in which both performance and interpretability are reconciled, is presented and evaluated in a context of churn prediction modeling. The recently proposed GAMens, based upon Bagging, the Random Subspace Method and semi-parametric GAMs as constituent classifiers, is extended to include two instruments for model interpretability: generalized feature importance scores, and bootstrap confidence bands for smoothing splines. In an experimental comparison on data sets of six real-life churn prediction projects, the competitive performance of the proposed algorithm over a set of well-known benchmark algorithms is demonstrated in terms of four evaluation metrics. Further, the ability of the technique to deliver valuable insight into the drivers of customer churn is illustrated in a case study on data from a European bank. Firstly, it is shown how the generalized feature importance scores allow the analyst to identify the relative importance of churn predictors in function of the criterion that is used to measure the quality of the model predictions. Secondly, the ability of GAMensPlus to identify nonlinear relationships between predictors and churn probabilities is demonstrated.