An instance-window based classification algorithm for handling gradual concept drifts

Authors:
Vahida Attar;Prashant Chaudhary;Sonali Rahagude;Gaurish Chaudhari;Pradeep Sinha
Affiliations:
College of Engineering, Pune (CoEP), Pune, India;College of Engineering, Pune (CoEP), Pune, India;College of Engineering, Pune (CoEP), Pune, India;College of Engineering, Pune (CoEP), Pune, India;Centre for Development of Advanced Computing (C-DAC), Pune, India
Venue:
ADMI'11 Proceedings of the 7th international conference on Agents and Data Mining Interaction
Year:
2011

Citing 10
Cited 0

Experimental comparisons of online and batch versions of bagging and boosting

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Online Ensemble Learning: An Empirical Study

Machine Learning
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Dynamic integration of classifiers for handling concept drift

Information Fusion
Negative correlation in incremental learning

Natural Computing: an international journal
Agent Mining: The Synergy of Agents and Data Mining

IEEE Intelligent Systems
Tracking recurring contexts using ensemble classifiers: an application to email filtering

Knowledge and Information Systems
The Impact of Diversity on Online Ensemble Learning in the Presence of Concept Drift

IEEE Transactions on Knowledge and Data Engineering
Learn++: an incremental learning algorithm for supervised neuralnetworks

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

Quantified Score

Hi-index	0.00

Visualization

Abstract

Mining concept drifting data stream is a challenging area for data mining research. In real world, data streams are not stable but change with time. Such changes termed as drifts in concept of the data stream are categorized into gradual and abrupt, based on the amount of drifting time, i.e. the time steps taken to replace the old concept completely by the new one. In traditional online learning systems, this categorization has not been exploited in developing different approaches for handling different types of drifts in the data stream. Such handling of concept drifts according to their type can help improve the performance of the classification system and hence, the issue can be explored further. Among the most popular and effective approaches to handle concept drifts is ensemble learning, where a set of models built over different time periods is maintained and the predictions of models are combined, usually according to their expertise level regarding the current concept. If early instances of new concept are stored and used for ensemble learning once the drift is detected, this may help increase the overall accuracy after the drift. Moreover, if an ensemble learns with zero diversity for instances of a new concept during the drifting period, the ensemble may learn the new concept faster, thus boosting recovery. The paper presents the above mentioned approach for effective handling of gradual concept drifts in the data streams.