Cost-Sensitive-Data Preprocessing for Mining Customer Relationship Management Databases

  • Authors:
  • Junfeng Pan;Qiang Yang;Yiming Yang;Lei Li;Frances Tianyi Li;George Wenmin Li

  • Affiliations:
  • Hong Kong University of Science and Technology;Hong Kong University of Science and Technology;Sun Yat-sen University;Sun Yat-sen University;Guangzhou E-DM Tech Corporation;Guangzhou E-DM Tech Corporation

  • Venue:
  • IEEE Intelligent Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Telecommunications companies and financial institutions are facing increasing competition. A staged preprocessing framework for cost-sensitive-data processing can help these companies identify customers who might switch to a competitor (or churn). The framework gives users an intuitive idea of the data distribution using a self-organizing map and then uses a cost matrix to help convert the data with an improved equidepth discretization method. The preprocessed data set can be input to any classifier. When tested on the KDD Cup 1998 data set, the framework performed better than the competition's winner. It has also been implemented in a software product called ED-Money and applied to a Chinese mobile telecommunication data set.