Dynamic integration of classifiers for handling concept drift

Authors:
Alexey Tsymbal;Mykola Pechenizkiy;Pádraig Cunningham;Seppo Puuronen
Affiliations:
Siemens AG, Günther-Scharowsky-Str. 1, 91058 Erlangen, Germany;Information Systems Group, Department of Computer Science, Eindhoven University of Technology, P.O. Box 513, 5600 MB Eindhoven, The Netherlands and Faculty of Information Technology, University of ...;School of Computer Science and Informatics, University College, Dublin 4, Ireland;Faculty of Information Technology, University of Jyväskylä, P.O. Box 35, Jyväskylä 40351, Finland
Venue:
Information Fusion
Year:
2008

Citing 19
Cited 22

Technical Note: Selecting a Classification Method by Cross-Validation

Machine Learning
Learning in the presence of concept drift and hidden contexts

Machine Learning
Combination of Multiple Classifiers Using Local Accuracy Estimates

IEEE Transactions on Pattern Analysis and Machine Intelligence
Locally Weighted Learning

Artificial Intelligence Review - Special issue on lazy learning
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Effective Learning in Dynamic Environments by Explicit Context Tracking

ECML '93 Proceedings of the European Conference on Machine Learning
Methods for Dynamic Classifier Selection

ICIAP '99 Proceedings of the 10th International Conference on Image Analysis and Processing
Dynamic Weighted Majority: A New Ensemble Method for Tracking Concept Drift

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Systematic data selection to mine concept-drifting data streams

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Handling Local Concept Drift with Dynamic Integration of Classifiers: Domain of Antibiotic Resistance in Nosocomial Infections

CBMS '06 Proceedings of the 19th IEEE Symposium on Computer-Based Medical Systems
Learning drifting concepts: Example selection vs. example weighting

Intelligent Data Analysis
Improved heterogeneous distance functions

Journal of Artificial Intelligence Research
Sequential genetic search for ensemble feature selection

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
A case-based technique for tracking concept drift in spam filtering

Knowledge-Based Systems
Dynamic integration with random forests

ECML'06 Proceedings of the 17th European conference on Machine Learning
Knowledge discovery from microbiology data: many-sided analysis of antibiotic resistance in nosocomial infections

WM'05 Proceedings of the Third Biennial conference on Professional Knowledge Management

AH 12 years later: a comprehensive survey of adaptive hypermedia methods and techniques

The New Review of Hypermedia and Multimedia - Adaptive Hypermedia
Combining Time and Space Similarity for Small Size Learning under Concept Drift

ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
Adaptive genetic programming for dynamic classification problems

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation
On classifying drifting concepts in P2P networks

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Mining concept-drifting data streams containing labeled and unlabeled instances

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
Beating the baseline prediction in food sales: How intelligent an intelligent predictor is?

Expert Systems with Applications: An International Journal
Context-aware collaborative data stream mining in ubiquitous devices

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Evolution of heterogeneous ensembles through dynamic particle swarm optimization for video-based face recognition

Pattern Recognition
An adaptive classification system for video-based face recognition

Information Sciences: an International Journal
Detecting change via competence model

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Classifier ensemble for uncertain data stream classification

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
An instance-window based classification algorithm for handling gradual concept drifts

ADMI'11 Proceedings of the 7th international conference on Agents and Data Mining Interaction
Evolutionary adapted ensemble for reoccurring context

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Ensemble approaches for regression: A survey

ACM Computing Surveys (CSUR)
A social approach for learning agents

Expert Systems with Applications: An International Journal
Dynamic multi-objective evolution of classifier ensembles for video face recognition

Applied Soft Computing
RCD: A recurring concept drift framework

Pattern Recognition Letters
An incremental learning algorithm based on the K-associated graph for non-stationary data classification

Information Sciences: an International Journal
A survey of multiple classifier systems as hybrid systems

Information Fusion
Concept drift detection via competence models

Artificial Intelligence
Just-in-time adaptive similarity component analysis in nonstationary environments

Journal of Intelligent & Fuzzy Systems: Applications in Engineering and Technology
Classifying evolving data streams with partially labeled data

Intelligent Data Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the real world concepts are often not stable but change with time. A typical example of this in the biomedical context is antibiotic resistance, where pathogen sensitivity may change over time as new pathogen strains develop resistance to antibiotics that were previously effective. This problem, known as concept drift, complicates the task of learning a model from data and requires special approaches, different from commonly used techniques that treat arriving instances as equally important contributors to the final concept. The underlying data distribution may change as well, making previously built models useless. This is known as virtual concept drift. Both types of concept drifts make regular updates of the model necessary. Among the most popular and effective approaches to handle concept drift is ensemble learning, where a set of models built over different time periods is maintained and the best model is selected or the predictions of models are combined, usually according to their expertise level regarding the current concept. In this paper we propose the use of an ensemble integration technique that would help to better handle concept drift at an instance level. In dynamic integration of classifiers, each base classifier is given a weight proportional to its local accuracy with regard to the instance tested, and the best base classifier is selected, or the classifiers are integrated using weighted voting. Our experiments with synthetic data sets simulating abrupt and gradual concept drifts and with a real-world antibiotic resistance data set demonstrate that dynamic integration of classifiers built over small time intervals or fixed-sized data blocks can be significantly better than majority voting and weighted voting, which are currently the most commonly used integration techniques for handling concept drift with ensembles.