Mining concept-drifting data streams using ensemble classifiers

Authors:
Haixun Wang;Wei Fan;Philip S. Yu;Jiawei Han
Affiliations:
IBM T. J. Watson Research, Hawthorne, NY;IBM T. J. Watson Research, Hawthorne, NY;IBM T. J. Watson Research, Hawthorne, NY;Univ. of Illinois, Urbana, IL
Venue:
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2003

Citing 19
Cited 225

Neural networks and the bias/variance dilemma

Neural Computation
C4.5: programs for machine learning

C4.5: programs for machine learning
BOAT—optimistic decision tree construction

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Mining high-speed data streams

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Space-efficient online computation of quantile summaries

SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
Mining time-changing data streams

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Models and issues in data stream systems

Proceedings of the twenty-first ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Continually evaluating similarity-based pattern queries on a streaming time series

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Continuous queries over data streams

ACM SIGMOD Record
Incremental Induction of Decision Trees

Machine Learning
A Unifeid Bias-Variance Decomposition and its Applications

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
SPRINT: A Scalable Parallel Classifier for Data Mining

VLDB '96 Proceedings of the 22th International Conference on Very Large Data Bases
Pruning and dynamic scheduling of cost-sensitive ensembles

Eighteenth national conference on Artificial intelligence
Clustering data streams

FOCS '00 Proceedings of the 41st Annual Symposium on Foundations of Computer Science
Progressive Modeling

ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
Multi-dimensional regression analysis of time-series data streams

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Inductive learning in less than one sequential data scan

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence

Cost-efficient mining techniques for data streams

ACSW Frontiers '04 Proceedings of the second workshop on Australasian information security, Data Mining and Web Intelligence, and Software Internationalisation - Volume 32
Systematic data selection to mine concept-drifting data streams

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Goal-oriented methods and meta methods for document classification and their parameter tuning

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Finding hot query patterns over an XQuery stream

The VLDB Journal — The International Journal on Very Large Data Bases
A native extension of SQL for mining data streams

Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Incremental rule learning based on example nearness from numerical data streams

Proceedings of the 2005 ACM symposium on Applied computing
Learning decision trees from dynamic data streams

Proceedings of the 2005 ACM symposium on Applied computing
Combining proactive and reactive predictions for data streams

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Streaming pattern discovery in multiple time-series

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Loadstar: load shedding in data stream mining

VLDB '05 Proceedings of the 31st international conference on Very large data bases
Mining data streams: a review

ACM SIGMOD Record
Time weight collaborative filtering

Proceedings of the 14th ACM international conference on Information and knowledge management
Tracking concept drifting with an online-optimized incremental learning framework

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Stream Cube: An Architecture for Multi-Dimensional Analysis of Data Streams

Distributed and Parallel Databases
Research issues in data stream association rule mining

ACM SIGMOD Record
A Framework for On-Demand Classification of Evolving Data Streams

IEEE Transactions on Knowledge and Data Engineering
Data streams classification by incremental rule learning with parameterized generalization

Proceedings of the 2006 ACM symposium on Applied computing
Suppressing model overfitting in mining concept-drifting data streams

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Recency-based collaborative filtering

ADC '06 Proceedings of the 17th Australasian Database Conference - Volume 49
Effective classification of noisy data streams with attribute-oriented dynamic classifier selection

Knowledge and Information Systems
An automatic construction and organization strategy for ensemble learning on data streams

ACM SIGMOD Record
Classification spanning correlated data streams

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Voting with a parameterized veto strategy: solving the KDD Cup 2006 problem by means of a classifier committee

ACM SIGKDD Explorations Newsletter
Decision trees for mining data streams

Intelligent Data Analysis
Effective variation management for pseudo periodical streams

Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Quality-Aware Sampling and Its Applications in Incremental Data Mining

IEEE Transactions on Knowledge and Data Engineering
Real-time ranking with concept drift using expert advice

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Cross-domain video concept detection using adaptive svms

Proceedings of the 15th international conference on Multimedia
A framework for generating data to simulate changing environments

AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
Dynamic integration of classifiers for handling concept drift

Information Fusion
StreamMiner: a classifier ensemble-based engine to mine concept-drifting data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Boolean representation based data-adaptive correlation analysis over time series streams

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace

ACM Transactions on Information Systems (TOIS)
Non-stationary data sequence classification using online class priors estimation

Pattern Recognition
Boosting classifiers for drifting concepts

Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Statistical supports for mining sequential patterns and improving the incremental update process on data streams

Intelligent Data Analysis - Knowlegde Discovery from Data Streams
Collaborative filtering on streaming data with interest-drifting

Intelligent Data Analysis - Knowlegde Discovery from Data Streams
An active learning system for mining time-changing data streams

Intelligent Data Analysis
Efficient instance-based learning on data streams

Intelligent Data Analysis
Designing an inductive data stream management system: the stream mill experience

SSPS '08 Proceedings of the 2nd international workshop on Scalable stream processing system
Approximate mining of maximal frequent itemsets in data streams with different window models

Expert Systems with Applications: An International Journal
Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts

The Journal of Machine Learning Research
Gene ontology annotation as text categorization: An empirical study

Information Processing and Management: an International Journal
Knowledge transfer via multiple model local structure mapping

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Categorizing and mining concept drifting data streams

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Meta methods for model sharing in personal information systems

ACM Transactions on Information Systems (TOIS)
Improving the performance of an incremental algorithm driven by error margins

Intelligent Data Analysis - Knowledge Discovery from Data Streams
Incremental tensor analysis: Theory and applications

ACM Transactions on Knowledge Discovery from Data (TKDD)
Info-fuzzy algorithms for mining dynamic data streams

Applied Soft Computing
Peer to peer botnet detection for cyber-security: a data mining approach

Proceedings of the 4th annual workshop on Cyber security and information intelligence research: developing strategies to meet the cyber security and information intelligence challenges ahead
An Efficient and Sensitive Decision Tree Approach to Mining Concept-Drifting Data Streams

Informatica
Distributed mining of censored production rules in data streams: an evolutionary approach

AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
An Incremental Fuzzy Decision Tree Classification Method for Mining Data Streams

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
DELAY: A Lazy Approach for Mining Frequent Patterns over High Speed Data Streams

ADMA '07 Proceedings of the 3rd international conference on Advanced Data Mining and Applications
Conceptual equivalence for contrast mining in classification learning

Data & Knowledge Engineering
Conceptual modeling rules extracting for data streams

Knowledge-Based Systems
Mining decision rules on data streams in the presence of concept drifts

Expert Systems with Applications: An International Journal
Boosting and measuring the performance of ensembles for a successful database marketing

Expert Systems with Applications: An International Journal
Unsupervised Classifier Selection Based on Two-Sample Test

DS '08 Proceedings of the 11th International Conference on Discovery Science
Combining Online Classification Approaches for Changing Environments

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Class Specific Fuzzy Decision Trees for Mining High Speed Data Streams

Fundamenta Informaticae
Incrementally Mining Recently Repeating Patterns over Data Streams

New Frontiers in Applied Data Mining
Adaptive correlation analysis in stream time series with sliding windows

Computers & Mathematics with Applications
Indexing density models for incremental learning and anytime classification on data streams

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Self-tuning query mesh for adaptive multi-route query processing

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Pruning an ensemble of classifiers via reinforcement learning

Neurocomputing
Intervention Events Detection and Prediction in Data Streams

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
A Multi-partition Multi-chunk Ensemble Technique to Classify Concept-Drifting Data Streams

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
CBDT: A Concept Based Approach to Data Stream Mining

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
An Aggregate Ensemble for Mining Concept Drifting Data Streams with Noise

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
PGG: an online pattern based approach for stream variation management

Journal of Computer Science and Technology
Flexible decision tree for data stream classification in the presence of concept change, noise and missing values

Data Mining and Knowledge Discovery
Collaborative filtering with temporal dynamics

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
A Cascade Multiple Classifier System for Document Categorization

MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
Online phishing classification using adversarial data mining and signaling games

Proceedings of the ACM SIGKDD Workshop on CyberSecurity and Intelligence Informatics
OcVFDT: one-class very fast decision tree for one-class classification of data streams

Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data
Drift-Aware Ensemble Regression

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Harnessing the strengths of anytime algorithms for constant data streams

Data Mining and Knowledge Discovery
Combining Time and Space Similarity for Small Size Learning under Concept Drift

ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
Lacking Labels in the Stream: Classifying Evolving Stream Data with Few Labels

ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
Integrating Novel Class Detection with Classification for Concept-Drifting Data Streams

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part II
History Guided Low-Cost Change Detection in Streams

DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Ambiguous decision trees for mining concept-drifting data streams

Pattern Recognition Letters
Multivariable stream data classification using motifs and their temporal relations

Information Sciences: an International Journal
Concept sampling: towards systematic selection in large-scale mixed concepts in machine learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Detect and track latent factors with online nonnegative matrix factorization

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Machine learning in disruption-tolerant MANETs

ACM Transactions on Autonomous and Adaptive Systems (TAAS)
Mining data streams with periodically changing distributions

Proceedings of the 18th ACM conference on Information and knowledge management
Graph-based transfer learning

Proceedings of the 18th ACM conference on Information and knowledge management
Enhancing recommender systems under volatile userinterest drifts

Proceedings of the 18th ACM conference on Information and knowledge management
Ensembles in adversarial classification for spam

Proceedings of the 18th ACM conference on Information and knowledge management
HE-Tree: a framework for detecting changes in clustering structure for categorical data streams

The VLDB Journal — The International Journal on Very Large Data Bases
Indexing ICD-9 codes for free-textual clinical diagnosis records by a new ensemble classifier

International Journal of Computational Intelligence in Bioinformatics and Systems Biology
Tracking Recurring Concepts with Meta-learners

EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Transfer Learning beyond Text Classification

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Mining Multi-label Concept-Drifting Data Streams Using Dynamic Classifier Ensemble

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Learning, detecting, understanding, and predicting concept changes

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Incremental learning in nonstationary environments with controlled forgetting

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
SERA: selectively recursive approach towards nonstationary imbalanced stream data mining

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Efficient decision tree construction for mining time-varying data streams

CASCON '09 Proceedings of the 2009 Conference of the Center for Advanced Studies on Collaborative Research
Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams

Proceedings of the 2010 conference on Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
Towards incremental classifier fusion

Intelligent Data Analysis
A case-based technique for tracking concept drift in spam filtering

Knowledge-Based Systems
Mining distributed evolving data streams using fractal GP ensembles

EuroGP'07 Proceedings of the 10th European conference on Genetic programming
Incremental learning of support vector machines by classifier combining

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Online rare events detection

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Analysis of firewall policy rules using traffic mining techniques

International Journal of Internet Protocol Technology
A new learning strategy for classification problems with different training and test distributions

IWANN'07 Proceedings of the 9th international work conference on Artificial neural networks
Quick adaptation to changing concepts by sensitive detection

IEA/AIE'07 Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems
An efficient algorithm for instance-based learning on data streams

ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
Incremental learning with multiple classifier systems using correction filters for classification

IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
A new fuzzy decision tree classification method for mining high-speed data streams based on binary search trees

FAW'07 Proceedings of the 1st annual international conference on Frontiers in algorithmics
Detecting concept drift using statistical testing

DS'07 Proceedings of the 10th international conference on Discovery science
A new decision tree classification method for mining high-speed data streams based on threaded binary search trees

PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
To better handle concept change and noise: a cellular automata approach to data stream classification

AI'07 Proceedings of the 20th Australian joint conference on Advances in artificial intelligence
Mining multi-label concept-drifting data streams using ensemble classifiers

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
An algorithmic approach to event summarization

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Transfer estimation of evolving class priors in data stream classification

Pattern Recognition
CALDS: context-aware learning from data streams

Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
λ-Perceptron: An adaptive classifier for data streams

Pattern Recognition
Partial drift detection using a rule induction framework

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Tracking recurrent concepts using context

RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Adaptive methods for classification in arbitrarily imbalanced and drifting data streams

PAKDD'09 Proceedings of the 13th Pacific-Asia international conference on Knowledge discovery and data mining: new frontiers in applied data mining
On classifying drifting concepts in P2P networks

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Classification and novel class detection of data streams in a dynamic feature space

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Robust ensemble learning for mining noisy data streams

Decision Support Systems
Active learning from stream data using optimal weight classifier ensemble

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Mining concept-drifting data streams containing labeled and unlabeled instances

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
Building a new classifier in an ensemble using streaming unlabeled data

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
Incremental mining of closed inter-transaction itemsets over data stream sliding windows

Journal of Information Science
Detecting movement patterns with wireless sensor networks: application to bird behavior

Proceedings of the 8th International Conference on Advances in Mobile Computing and Multimedia
On-line learning: where are we so far?

Ubiquitous knowledge discovery
Learning recurring concepts from data streams with a context-aware ensemble

Proceedings of the 2011 ACM Symposium on Applied Computing
Efficient decision tree re-alignment for clustering time-changing data streams

From active data management to event-based systems and more
On-line learning: where are we so far?

Ubiquitous knowledge discovery
A robust incremental learning method for non-stationary environments

Neurocomputing
Finding semantics in time series

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Gas sensor drift mitigation using classifier ensembles

Proceedings of the Fifth International Workshop on Knowledge Discovery from Sensor Data
Precise anytime clustering of noisy sensor data with logarithmic complexity

Proceedings of the Fifth International Workshop on Knowledge Discovery from Sensor Data
Editorial: Classifying text streams by keywords using classifier ensemble

Data & Knowledge Engineering
Effective sentiment stream analysis with self-augmenting training and demand-driven projection

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Random ensemble decision trees for learning concept-drifting data streams

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
A Cluster-Based Context-Tree Model for Multivariate Data Streams with Applications to Anomaly Detection

INFORMS Journal on Computing
Cloud-based malware detection for evolving data streams

ACM Transactions on Management Information Systems (TMIS)
Enabling fast prediction for ensemble models on data streams

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Accuracy updated ensemble for data streams with concept drift

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
Batch weighted ensemble for mining data streams with concept drift

ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
Concurrent semi-supervised learning of data streams

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Tracking concept change with incremental boosting by minimization of the evolving exponential loss

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Mining data streams with concept drifts using genetic algorithm

Artificial Intelligence Review
SCENT: Scalable compressed monitoring of evolving multirelational social networks

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP) - Special section on ACM multimedia 2010 best paper candidates, and issue on social media
Beating the baseline prediction in food sales: How intelligent an intelligent predictor is?

Expert Systems with Applications: An International Journal
An efficient continuous attributes handling method for mining concept-drifting data streams based on skip list

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part I
A clustering algorithm for multiple data streams based on spectral component similarity

Information Sciences: an International Journal
Pattern change discovery between high dimensional data sets

Proceedings of the 20th ACM international conference on Information and knowledge management
Context-aware collaborative data stream mining in ubiquitous devices

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Knowledge maintenance on data streams with concept drifting

CIS'04 Proceedings of the First international conference on Computational and Information Science
Incremental algorithm driven by error margins

DS'06 Proceedings of the 9th international conference on Discovery Science
Adaptive classifier selection based on two level hypothesis tests for incremental learning

SSPR'06/SPR'06 Proceedings of the 2006 joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Classifying noisy data streams

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
An adaptive nearest neighbor classification algorithm for data streams

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
A random method for quantifying changing distributions in data streams

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Improving the performance of data stream classifiers by mining recurring contexts

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
ACE: adaptive classifiers-ensemble system for concept-drifting environments

MCS'05 Proceedings of the 6th international conference on Multiple Classifier Systems
Classification and novel class detection in data streams with active mining

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Using restrictive classification and meta classification for junk elimination

ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
Multivariate stream data classification using simple text classifiers

DEXA'06 Proceedings of the 17th international conference on Database and Expert Systems Applications
A scalable supervised algorithm for dimensionality reduction on streaming data

Information Sciences: an International Journal
Handling different categories of concept drifts in data streams using distributed GP

EuroGP'10 Proceedings of the 13th European conference on Genetic Programming
Detecting change via competence model

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Classifier ensemble for uncertain data stream classification

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Stock fraud detection using peer group analysis

Expert Systems with Applications: An International Journal
Mining databases and data streams with query languages and rules

KDID'05 Proceedings of the 4th international conference on Knowledge Discovery in Inductive Databases
Automatic document organization in a p2p environment

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Mining uncertain data streams using clustering feature decision trees

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
An instance-window based classification algorithm for handling gradual concept drifts

ADMI'11 Proceedings of the 7th international conference on Agents and Data Mining Interaction
Learning from concept drifting data streams with unlabeled data

Neurocomputing
A framework for application-driven classification of data streams

Neurocomputing
Data with shifting concept classification using simulated recurrence

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part I
Evolutionary adapted ensemble for reoccurring context

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Online linear and quadratic discriminant analysis with adaptive forgetting for streaming classification

Statistical Analysis and Data Mining
Learning decision rules from data streams

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Kernel-based selective ensemble learning for streams of trees

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Biclustering-driven ensemble of Bayesian belief network classifiers for underdetermined problems

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
2011 Special Issue: A just-in-time adaptive classification system based on the intersection of confidence intervals rule

Neural Networks
Event-based classification of social media streams

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Learning very fast decision tree from uncertain data streams with positive and unlabeled samples

Information Sciences: an International Journal
A double-ensemble approach for classifying skewed data streams

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Heterogeneous ensemble for feature drifts in data streams

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Unexpected challenges in large scale machine learning

Proceedings of the 1st International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
BT*: an advanced algorithm for anytime classification

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Bayesian approach to the concept drift in the pattern recognition problems

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Class Specific Fuzzy Decision Trees for Mining High Speed Data Streams

Fundamenta Informaticae
Ensemble approaches for regression: A survey

ACM Computing Surveys (CSUR)
Fuzzy based privacy preserving classification of data streams

Proceedings of the CUBE International Information Technology Conference
Data stream classification with artificial endocrine system

Applied Intelligence
A new method of mining data streams using harmony search

Journal of Intelligent Information Systems
An attempt to employ genetic fuzzy systems to predict from a data stream of premises transactions

SUM'12 Proceedings of the 6th international conference on Scalable Uncertainty Management
Handling time changing data with adaptive very fast decision rules

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Next challenges for adaptive learning systems

ACM SIGKDD Explorations Newsletter
Online predictive model for taxi services

IDA'12 Proceedings of the 11th international conference on Advances in Intelligent Data Analysis
Batch-incremental versus instance-incremental learning in dynamic and evolving data

IDA'12 Proceedings of the 11th international conference on Advances in Intelligent Data Analysis
A rank-one update method for least squares linear discriminant analysis with concept drift

Pattern Recognition
An analysis of change trends by predicting from a data stream using genetic fuzzy systems

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
A data-mining approach to preference-based data ranking founded on contextual information

Information Systems
An ensemble clustering model for mining concept drifting stream data in emergency management

DM-IKM '12 Proceedings of the Data Mining and Intelligent Knowledge Management Workshop
Dynamic multi-objective evolution of classifier ensembles for video face recognition

Applied Soft Computing
RCD: A recurring concept drift framework

Pattern Recognition Letters
Real time processing of data from patient biodevices

HIKM '11 Proceedings of the Fourth Australasian Workshop on Health Informatics and Knowledge Management - Volume 120
Recentness biased learning for time series forecasting

Information Sciences: an International Journal
An incremental learning algorithm based on the K-associated graph for non-stationary data classification

Information Sciences: an International Journal
Learning from data streams with only positive and unlabeled data

Journal of Intelligent Information Systems
A survey on concept drift adaptation

ACM Computing Surveys (CSUR)
Mining Data Streams with Skewed Distribution based on Ensemble Method

International Journal of Advanced Pervasive and Ubiquitous Computing
Predicting knowledge in an ontology stream

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
A survey of multiple classifier systems as hybrid systems

Information Fusion
STAR-CITY: semantic traffic analytics and reasoning for CITY

Proceedings of the 19th international conference on Intelligent User Interfaces
MetaStream: A meta-learning based method for periodic algorithm selection in time-changing data

Neurocomputing
Combining block-based and online methods in learning ensembles from concept drifting data streams

Information Sciences: an International Journal
Concept drift detection via competence models

Artificial Intelligence
Design and Implementation of a Data Mining System for Malware Detection

Journal of Integrated Design & Process Science
Classifying evolving data streams with partially labeled data

Intelligent Data Analysis
Tracking recurrent concepts using context

Intelligent Data Analysis - Combined Learning Methods and Mining Complex Data

Quantified Score

Hi-index	0.01

Visualization

Abstract

Recently, mining data streams with concept drifts for actionable insights has become an important and challenging task for a wide range of applications including credit card fraud protection, target marketing, network intrusion detection, etc. Conventional knowledge discovery tools are facing two challenges, the overwhelming volume of the streaming data, and the concept drifts. In this paper, we propose a general framework for mining concept-drifting data streams using weighted ensemble classifiers. We train an ensemble of classification models, such as C4.5, RIPPER, naive Beyesian, etc., from sequential chunks of the data stream. The classifiers in the ensemble are judiciously weighted based on their expected classification accuracy on the test data under the time-evolving environment. Thus, the ensemble approach improves both the efficiency in learning the model and the accuracy in performing classification. Our empirical study shows that the proposed methods have substantial advantage over single-classifier approaches in prediction accuracy, and the ensemble framework is effective for a variety of classification models.