Handling concept drift via ensemble and class distribution estimation technique

  • Authors:
  • Nachai Limsetto;Kitsana Waiyamai

  • Affiliations:
  • Data Analysis and Knowledge Discovery Lab (DAKDL) Department of computer engineering Faculty of Engineering, Kasetsart University, Bangkok, Thailand;Data Analysis and Knowledge Discovery Lab (DAKDL) Department of computer engineering Faculty of Engineering, Kasetsart University, Bangkok, Thailand

  • Venue:
  • ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In real world settings there is situation where class distribution of data may change after classifier is built resulting in performance degradation of classifier. Attempts to solve this problem from previous Class Distribution Estimation method (CDE method) yield quite interesting performance however we notice there is some flaw since CDE method still have some bias toward train data thus we decide to improve them with ensemble method. Our Class Distribution Estimation-Ensemble (CDE-EM) methods estimate class distribution from many models instead of one resulting in less bias than previous method. All methods are evaluated using accuracy on set of benchmark UCI data sets. Experimental results demonstrate that our methods yield better performance if class distribution of test data is different from train data.