Learning in the presence of concept drift and hidden contexts
Machine Learning
Separate-and-Conquer Rule Learning
Artificial Intelligence Review
Tutorial on Practical Prediction Theory for Classification
The Journal of Machine Learning Research
A Discretization Algorithm Based on a Heterogeneity Criterion
IEEE Transactions on Knowledge and Data Engineering
Google news personalization: scalable online collaborative filtering
Proceedings of the 16th international conference on World Wide Web
Bayesian finite mixtures with an unknown number of components: The allocation sampler
Statistics and Computing
Information integration in the enterprise
Communications of the ACM - Enterprise information integration: and other tools for merging data
Methodologies for data quality assessment and improvement
ACM Computing Surveys (CSUR)
Issues in evaluation of stream learning algorithms
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Handbook of Statistical Analysis and Data Mining Applications
Handbook of Statistical Analysis and Data Mining Applications
Adaptive Learning from Evolving Data Streams
IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
The WEKA data mining software: an update
ACM SIGKDD Explorations Newsletter
Adaptive concept drift detection
Statistical Analysis and Data Mining - Best of SDM'09
Change Mining of Customer Profiles Based on Transactional Data
ICDMW '11 Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops
Bias management of bayesian network classifiers
DS'05 Proceedings of the 8th international conference on Discovery Science
Customer relationship management and Web mining: the next frontier
Data Mining and Knowledge Discovery
Hi-index | 0.01 |
Customer profiles are, by definition, made up of factual and transactional data. It is often the case that due to reasons such as high cost of data acquisition and/or protection, only the transactional data are available for data mining operations. Transactional data, however, tend to be highly sparse and skewed due to a large proportion of customers engaging in very few transactions. This can result in a bias in the prediction accuracy of classifiers built using them. The problem is even more so when identifying and classifying changing customer profiles whose classification may change either due to a concept drift or due to a change in buying behaviour. This paper presents a comparative investigation of 4 approaches for classifying dynamic customer profiles built using evolving transactional data over time. The changing class values of the customer profiles were analysed together with the challenging problem of deciding whether to change the class label or adapt the classifier. The results from the experiments we conducted on a highly sparse and skewed real-world transactional data show that adapting the classifiers leads to more stable classification of customer profiles in the shorter time windows; while relabelling the changed customer profile classes leads to more accurate and stable classification in the longer time windows.