Editorial: Classifying text streams by keywords using classifier ensemble
Data & Knowledge Engineering
Mining Recurring Concept Drifts with Limited Labeled Streaming Data
ACM Transactions on Intelligent Systems and Technology (TIST)
Hi-index | 0.00 |
Mining data streams has attracted much attention recently. Labeled samples needed by most current stream classification methods are more difficult and expensive to obtain than unlabeled ones. This paper proposed a semisupervised learning algorithm - clustering-training to utilize the unlabeled samples. It uses clustering to select confidently unlabeled samples, and uses them to re-train the classifier incrementally. Experiments on synthetic and real data set showed the effectiveness of the proposed algorithm.