Preprocessing time series data for classification with application to CRM

Authors:
Yiming Yang;Qiang Yang;Wei Lu;Jialin Pan;Rong Pan;Chenhui Lu;Lei Li;Zhenxing Qin
Affiliations:
Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Department of Computer Science, Hong Kong University of Science and Technology, Kowloon, Hong Kong, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Department of Computer Science, Hong Kong University of Science and Technology, Kowloon, Hong Kong, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Software Institute, Zhongshan University, Guangzhou, Guangdong Province, China;Faculty of Information Technology, University of Technology, Broadway, Sydney, NSW, Australia
Venue:
AI'05 Proceedings of the 18th Australian Joint conference on Advances in Artificial Intelligence
Year:
2005

Citing 10
Cited 2

MetaCost: a general method for making classifiers cost-sensitive

KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Visualization of navigation patterns on a Web site using model-based clustering

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Learning and making decisions when costs and probabilities are both unknown

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A Simple Generalisation of the Area Under the ROC Curve for Multiple Class Classification Problems

Machine Learning
Mining Sequential Patterns

ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Data Mining of User Navigation Patterns

WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Mining Customer Value: From Association Rules to Direct Marketing

Data Mining and Knowledge Discovery
AUC: a statistically consistent and more discriminating measure than accuracy

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
The foundations of cost-sensitive learning

IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
A probabilistic approach to navigation in Hypertext

Information Sciences: an International Journal

Classifying execution times in parallel computing systems: a classical hypothesis testing approach

CIARP'11 Proceedings of the 16th Iberoamerican Congress conference on Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Nearest-neighbor-based approach to time-series classification

Decision Support Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We develop an innovative data preprocessing algorithm for classifying customers using unbalanced time series data. This problem is directly motivated by an application whose aim is to uncover the customers’ churning behavior in the telecommunication industry. We model this problem as a sequential classification problem, and present an effective solution for solving the challenging problem, where the elements in the sequences are of a multi-dimensional nature, the sequences are uneven in length and classes of the data are highly unbalanced. Our solution is to integrate model based clustering and develop an innovative data preprocessing algorithm for the time series data. In this paper, we provide the theory and algorithms for the task, and empirically demonstrate that the method is effective in determining the customer class for CRM applications in the telecommunications industry.