A streaming ensemble algorithm (SEA) for large-scale classification

Authors:
W. Nick Street;YongSeog Kim
Affiliations:
University of Iowa, Iowa City, IA;University of Iowa, Iowa City, IA
Venue:
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Year:
2001

Citing 13
Cited 164

The Strength of Weak Learnability

Machine Learning
C4.5: programs for machine learning

C4.5: programs for machine learning
Methods for combining experts' probability assessments

Neural Computation
Learning in the presence of concept drift and hidden contexts

Machine Learning
Bagging predictors

Machine Learning
On the Accuracy of Meta-learning for Scalable Data Mining

Journal of Intelligent Information Systems
Advances in knowledge discovery and data mining

Advances in knowledge discovery and data mining
BOAT—optimistic decision tree construction

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Feature selection for ensembles

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Data selection for support vector machine classifiers

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
Mining high-speed data streams

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Empirical analysis of predictive algorithms for collaborative filtering

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence

MobiMine: monitoring the stock market from a PDA

ACM SIGKDD Explorations Newsletter
Distributed learning with bagging-like performance

Pattern Recognition Letters
Racing Committees for Large Datasets

DS '02 Proceedings of the 5th International Conference on Discovery Science
Exploiting unlabeled data in ensemble methods

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Feature selection in data mining

Data mining
Dynamic Weighted Majority: A New Ensemble Method for Tracking Concept Drift

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
A Fourier Spectrum-Based Approach to Represent Decision Trees for Mining Data Streams in Mobile Environments

IEEE Transactions on Knowledge and Data Engineering
Learning Ensembles from Bites: A Scalable and Accurate Approach

The Journal of Machine Learning Research
Genetic programming in classifying large-scale data: an ensemble method

Information Sciences: an International Journal - Special issue: Soft computing data mining
Systematic data selection to mine concept-drifting data streams

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Incremental rule learning based on example nearness from numerical data streams

Proceedings of the 2005 ACM symposium on Applied computing
Learning decision trees from dynamic data streams

Proceedings of the 2005 ACM symposium on Applied computing
Combining proactive and reactive predictions for data streams

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Using additive expert ensembles to cope with concept drift

ICML '05 Proceedings of the 22nd international conference on Machine learning
On Reducing Classifier Granularity in Mining Concept-Drifting Data Streams

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Sharing Classifiers among Ensembles from Related Problem Domains

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Feature Selection for Building Cost-Effective Data Stream Classifiers

ICDM '05 Proceedings of the Fifth IEEE International Conference on Data Mining
Data streams classification by incremental rule learning with parameterized generalization

Proceedings of the 2006 ACM symposium on Applied computing
Suppressing model overfitting in mining concept-drifting data streams

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Orthogonal Decision Trees

IEEE Transactions on Knowledge and Data Engineering
An automatic construction and organization strategy for ensemble learning on data streams

ACM SIGMOD Record
Decision trees for mining data streams

Intelligent Data Analysis
Incremental discretization, application to data with concept drift

Proceedings of the 2007 ACM symposium on Applied computing
Ensemble Pruning Via Semi-definite Programming

The Journal of Machine Learning Research
Real-time ranking with concept drift using expert advice

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A framework for generating data to simulate changing environments

AIAP'07 Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: artificial intelligence and applications
Dynamic integration of classifiers for handling concept drift

Information Fusion
StreamMiner: a classifier ensemble-based engine to mine concept-drifting data streams

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Boosting classifiers for drifting concepts

Intelligent Data Analysis - Knowlegde Discovery from Data Streams
An active learning system for mining time-changing data streams

Intelligent Data Analysis
A case-study on naïve labelling for the nearest mean and the linear discriminant classifiers

Pattern Recognition
Learning in Environments with Unknown Dynamics: Towards more Robust Concept Learners

The Journal of Machine Learning Research
Dynamic Weighted Majority: An Ensemble Method for Drifting Concepts

The Journal of Machine Learning Research
Categorizing and mining concept drifting data streams

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
An Efficient and Sensitive Decision Tree Approach to Mining Concept-Drifting Data Streams

Informatica
Conceptual equivalence for contrast mining in classification learning

Data & Knowledge Engineering
Mining decision rules on data streams in the presence of concept drifts

Expert Systems with Applications: An International Journal
Boosting and measuring the performance of ensembles for a successful database marketing

Expert Systems with Applications: An International Journal
A comparative study of simple online learning strategies for streaming data

WSEAS Transactions on Circuits and Systems
Class Specific Fuzzy Decision Trees for Mining High Speed Data Streams

Fundamenta Informaticae
Indexing density models for incremental learning and anytime classification on data streams

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Pruning an ensemble of classifiers via reinforcement learning

Neurocomputing
Online learning strategies for classification of static data streams

DIWEB'08 Proceedings of the 8th WSEAS international conference on Distance learning and web engineering
An Aggregate Ensemble for Mining Concept Drifting Data Streams with Noise

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Flexible decision tree for data stream classification in the presence of concept change, noise and missing values

Data Mining and Knowledge Discovery
New ensemble methods for evolving data streams

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Issues in evaluation of stream learning algorithms

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Collaborative filtering with temporal dynamics

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Using Ensemble-Based Reasoning to Help Experts in Melanoma Diagnosis

Proceedings of the 2008 conference on Artificial Intelligence Research and Development: Proceedings of the 11th International Conference of the Catalan Association for Artificial Intelligence
A Cascade Multiple Classifier System for Document Categorization

MCS '09 Proceedings of the 8th International Workshop on Multiple Classifier Systems
OcVFDT: one-class very fast decision tree for one-class classification of data streams

Proceedings of the Third International Workshop on Knowledge Discovery from Sensor Data
Drift-Aware Ensemble Regression

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Concept Drifting Detection on Noisy Streaming Data in Random Ensemble Decision Trees

MLDM '09 Proceedings of the 6th International Conference on Machine Learning and Data Mining in Pattern Recognition
Harnessing the strengths of anytime algorithms for constant data streams

Data Mining and Knowledge Discovery
Combining Time and Space Similarity for Small Size Learning under Concept Drift

ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
Adaptive Learning from Evolving Data Streams

IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Ambiguous decision trees for mining concept-drifting data streams

Pattern Recognition Letters
Concept sampling: towards systematic selection in large-scale mixed concepts in machine learning

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Sequential genetic search for ensemble feature selection

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Learn++.NC: combining ensemble of classifiers with dynamically weighted consult-and-vote for efficient incremental learning of new classes

IEEE Transactions on Neural Networks
Tracking Recurring Concepts with Meta-learners

EPIA '09 Proceedings of the 14th Portuguese Conference on Artificial Intelligence: Progress in Artificial Intelligence
Improving Adaptive Bagging Methods for Evolving Data Streams

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Mining Multi-label Concept-Drifting Data Streams Using Dynamic Classifier Ensemble

ACML '09 Proceedings of the 1st Asian Conference on Machine Learning: Advances in Machine Learning
Learning, detecting, understanding, and predicting concept changes

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Incremental learning in nonstationary environments with controlled forgetting

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
SERA: selectively recursive approach towards nonstationary imbalanced stream data mining

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams

Proceedings of the 2010 conference on Adaptive Stream Mining: Pattern Learning and Mining from Evolving Data Streams
A case-based technique for tracking concept drift in spam filtering

Knowledge-Based Systems
An ensemble approach for incremental learning in nonstationary environments

MCS'07 Proceedings of the 7th international conference on Multiple classifier systems
Mining distributed evolving data streams using fractal GP ensembles

EuroGP'07 Proceedings of the 10th European conference on Genetic programming
Quick adaptation to changing concepts by sensitive detection

IEA/AIE'07 Proceedings of the 20th international conference on Industrial, engineering, and other applications of applied intelligent systems
Incremental learning with multiple classifier systems using correction filters for classification

IDA'07 Proceedings of the 7th international conference on Intelligent data analysis
Unsupervised change analysis using supervised learning

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Mining multi-label concept-drifting data streams using ensemble classifiers

FSKD'09 Proceedings of the 6th international conference on Fuzzy systems and knowledge discovery - Volume 5
Data compression by volume prototypes for streaming data

Pattern Recognition
CALDS: context-aware learning from data streams

Proceedings of the First International Workshop on Novel Data Stream Pattern Mining Techniques
Drift Severity Metric

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Partial drift detection using a rule induction framework

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Tracking recurrent concepts using context

RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Describing data with the support vector shell in distributed environments

ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
Adaptive methods for classification in arbitrarily imbalanced and drifting data streams

PAKDD'09 Proceedings of the 13th Pacific-Asia international conference on Knowledge discovery and data mining: new frontiers in applied data mining
On classifying drifting concepts in P2P networks

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Leveraging bagging for evolving data streams

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part I
Robust ensemble learning for mining noisy data streams

Decision Support Systems
Active learning from stream data using optimal weight classifier ensemble

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Building a new classifier in an ensemble using streaming unlabeled data

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part II
Operative assessment of predicted generalization errors on non-stationary distributions in data-intensive applications

Intelligent Data Analysis
Learning recurring concepts from data streams with a context-aware ensemble

Proceedings of the 2011 ACM Symposium on Applied Computing
Incremental learning with multi-level adaptation

Neurocomputing
A robust incremental learning method for non-stationary environments

Neurocomputing
Precise anytime clustering of noisy sensor data with logarithmic complexity

Proceedings of the Fifth International Workshop on Knowledge Discovery from Sensor Data
Editorial: Classifying text streams by keywords using classifier ensemble

Data & Knowledge Engineering
Effective sentiment stream analysis with self-augmenting training and demand-driven projection

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Incremental Gaussian Discriminant Analysis based on Graybill and Deal weighted combination of estimators for brain tumour diagnosis

Journal of Biomedical Informatics
Random ensemble decision trees for learning concept-drifting data streams

PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part I
Enabling fast prediction for ensemble models on data streams

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Accuracy updated ensemble for data streams with concept drift

HAIS'11 Proceedings of the 6th international conference on Hybrid artificial intelligent systems - Volume Part II
A high-order feature synthesis and selection algorithm applied to insurance risk modelling

International Journal of Business Intelligence and Data Mining
Concurrent semi-supervised learning of data streams

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
A bounded version of online boosting on open-ended data streams

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Tracking concept change with incremental boosting by minimization of the evolving exponential loss

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I
Beating the baseline prediction in food sales: How intelligent an intelligent predictor is?

Expert Systems with Applications: An International Journal
Artificial recurrence for classification of streaming data with concept shift

ICAIS'11 Proceedings of the Second international conference on Adaptive and intelligent systems
Context-aware collaborative data stream mining in ubiquitous devices

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Learning about the learning process

IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Knowledge maintenance on data streams with concept drifting

CIS'04 Proceedings of the First international conference on Computational and Information Science
Mining Recurring Concept Drifts with Limited Labeled Streaming Data

ACM Transactions on Intelligent Systems and Technology (TIST)
Evaluation of summarization schemes for learning in streams

PKDD'06 Proceedings of the 10th European conference on Principle and Practice of Knowledge Discovery in Databases
Adaptive classifier selection based on two level hypothesis tests for incremental learning

SSPR'06/SPR'06 Proceedings of the 2006 joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Classifying noisy data streams

FSKD'06 Proceedings of the Third international conference on Fuzzy Systems and Knowledge Discovery
Orthogonal decision trees for resource-constrained physiological data stream monitoring using mobile devices

HiPC'05 Proceedings of the 12th international conference on High Performance Computing
An adaptive nearest neighbor classification algorithm for data streams

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Learning with local drift detection

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Improving the performance of data stream classifiers by mining recurring contexts

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
ACE: adaptive classifiers-ensemble system for concept-drifting environments

MCS'05 Proceedings of the 6th international conference on Multiple Classifier Systems
Temporal evolution and local patterns

LPD'04 Proceedings of the 2004 international conference on Local Pattern Detection
Fast perceptron decision tree learning from evolving data streams

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Handling different categories of concept drifts in data streams using distributed GP

EuroGP'10 Proceedings of the 13th European conference on Genetic Programming
Detecting change via competence model

ICCBR'10 Proceedings of the 18th international conference on Case-Based Reasoning Research and Development
Mining uncertain data streams using clustering feature decision trees

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
An instance-window based classification algorithm for handling gradual concept drifts

ADMI'11 Proceedings of the 7th international conference on Agents and Data Mining Interaction
Learning from concept drifting data streams with unlabeled data

Neurocomputing
A framework for application-driven classification of data streams

Neurocomputing
Very Fast Decision Rules for multi-class problems

Proceedings of the 27th Annual ACM Symposium on Applied Computing
Semi-supervised ensemble learning of data streams in the presence of concept drift

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Drift detection and model selection algorithms: concept and experimental evaluation

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Online linear and quadratic discriminant analysis with adaptive forgetting for streaming classification

Statistical Analysis and Data Mining
Learning decision rules from data streams

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
Kernel-based selective ensemble learning for streams of trees

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Two
2011 Special Issue: A just-in-time adaptive classification system based on the intersection of confidence intervals rule

Neural Networks
Learning very fast decision tree from uncertain data streams with positive and unlabeled samples

Information Sciences: an International Journal
A new fuzzy classifier for data streams

ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part I
A double-ensemble approach for classifying skewed data streams

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Heterogeneous ensemble for feature drifts in data streams

PAKDD'12 Proceedings of the 16th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part II
Nonlinear single layer neural network training algorithm for incremental, nonstationary and distributed learning scenarios

Pattern Recognition
Bayesian approach to the concept drift in the pattern recognition problems

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
Class Specific Fuzzy Decision Trees for Mining High Speed Data Streams

Fundamenta Informaticae
Classifier Ensemble for Imbalanced Data Stream Classification

Proceedings of the CUBE International Information Technology Conference
A new method of mining data streams using harmony search

Journal of Intelligent Information Systems
New management operations on classifiers pool to track recurring concepts

DaWaK'12 Proceedings of the 14th international conference on Data Warehousing and Knowledge Discovery
Handling time changing data with adaptive very fast decision rules

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I
Next challenges for adaptive learning systems

ACM SIGKDD Explorations Newsletter
Batch-incremental versus instance-incremental learning in dynamic and evolving data

IDA'12 Proceedings of the 11th international conference on Advances in Intelligent Data Analysis
Extension of incremental linear discriminant analysis to online feature extraction under nonstationary environments

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part II
On evaluating stream learning algorithms

Machine Learning
A data-mining approach to preference-based data ranking founded on contextual information

Information Systems
An ontology enhanced parallel SVM for scalable spam filter training

Neurocomputing
Coarse-to-fine multiclass learning and classification for time-critical domains

Pattern Recognition Letters
Dynamic multi-objective evolution of classifier ensembles for video face recognition

Applied Soft Computing
Stream-based event prediction using bayesian and bloom filters

Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Efficient data stream classification via probabilistic adaptive windows

Proceedings of the 28th Annual ACM Symposium on Applied Computing
A hidden Markov model for collaborative filtering

MIS Quarterly
Real time processing of data from patient biodevices

HIKM '11 Proceedings of the Fourth Australasian Workshop on Health Informatics and Knowledge Management - Volume 120
An incremental learning algorithm based on the K-associated graph for non-stationary data classification

Information Sciences: an International Journal
Learning from data streams with only positive and unlabeled data

Journal of Intelligent Information Systems
A survey on concept drift adaptation

ACM Computing Surveys (CSUR)
Mining Data Streams with Skewed Distribution based on Ensemble Method

International Journal of Advanced Pervasive and Ubiquitous Computing
Ensemble of online neural networks for non-stationary and imbalanced data streams

Neurocomputing
A lossy counting based approach for learning on streams of graphs on a budget

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
A survey of multiple classifier systems as hybrid systems

Information Fusion
Combining block-based and online methods in learning ensembles from concept drifting data streams

Information Sciences: an International Journal
Concept drift detection via competence models

Artificial Intelligence
Tracking recurrent concepts using context

Intelligent Data Analysis - Combined Learning Methods and Mining Complex Data

Quantified Score

Hi-index	0.00

Visualization

Abstract

Ensemble methods have recently garnered a great deal of attention in the machine learning community. Techniques such as Boosting and Bagging have proven to be highly effective but require repeated resampling of the training data, making them inappropriate in a data mining context. The methods presented in this paper take advantage of plentiful data, building separate classifiers on sequential chunks of training points. These classifiers are combined into a fixed-size ensemble using a heuristic replacement strategy. The result is a fast algorithm for large-scale or streaming data that classifies as well as a single decision tree built on all the data, requires approximately constant memory, and adjusts quickly to concept drift.