Preprocessing time series data for classification with application to CRM

  • Authors:
  • Yiming Yang;Qiang Yang;Wei Lu;Jialin Pan;Rong Pan;Chenhui Lu;Lei Li;Zhenxing Qin

  • Affiliations:
  • Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Department of Computer Science, Hong Kong University of Science and Technology, Kowloon, Hong Kong, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Department of Computer Science, Hong Kong University of Science and Technology, Kowloon, Hong Kong, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Faculty of Information Technology, University of Technology, Broadway, Sydney, NSW, Australia

  • Venue:
  • AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We develop an innovative data preprocessing algorithm for classifying customers using unbalanced time series data. This problem is directly motivated by an application whose aim is to uncover the customers’ churning behavior in the telecommunication industry. We model this problem as a sequential classification problem, and present an effective solution for solving the challenging problem, where the elements in the sequences are of a multi-dimensional nature, the sequences are uneven in length and classes of the data are highly unbalanced. Our solution is to integrate model based clustering and develop an innovative data preprocessing algorithm for the time series data. In this paper, we provide the theory and algorithms for the task, and empirically demonstrate that the method is effective in determining the customer class for CRM applications in the telecommunications industry.