Active learning from stream data using optimal weight classifier ensemble

Authors:
Xingquan Zhu;Peng Zhang;Xiaodong Lin;Yong Shi
Affiliations:
Department of Computer Science and Engineering, Florida Atlantic University, Boca Raton, FL and QCIS Center, Faculty of Engineering and Information Technology, University of Technology, Sydney, NS ...;Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Department of Management Science and Information Systems, Rutgers Business School, Rutgers, the State University of New Jersey, Newark, NJ;College of Information Science and Technology, University of Nebraska at Omaha, NE and Fictitious Economy and Data Science Research Center, Chinese Academy of Sciences, Beijing, China
Venue:
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Year:
2010

Citing 42
Cited 11

Query by committee

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
C4.5: programs for machine learning

C4.5: programs for machine learning
Decision Combination in Multiple Classifier Systems

IEEE Transactions on Pattern Analysis and Machine Intelligence
Improving Generalization with Active Learning

Machine Learning - Special issue on structured connectionist systems
Support-Vector Networks

Machine Learning
Bagging predictors

Machine Learning
Selective Sampling Using the Query by Committee Algorithm

Machine Learning
On the Optimality of the Simple Bayesian Classifier under Zero-One Loss

Machine Learning - Special issue on learning with probabilistic representations
On Combining Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
Mining high-speed data streams

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
A streaming ensemble algorithm (SEA) for large-scale classification

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
On Bias, Variance, 0/1—Loss, and the Curse-of-Dimensionality

Data Mining and Knowledge Discovery
Sum Versus Vote Fusion in Multiple Classifier Systems

IEEE Transactions on Pattern Analysis and Machine Intelligence
Incremental Induction of Decision Trees

Machine Learning
Query Learning Strategies Using Boosting and Bagging

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Query Learning with Large Margin Classifiers

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Mining concept-drifting data streams using ensemble classifiers

Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
A Probabilistic Active Support Vector Learning Algorithm

IEEE Transactions on Pattern Analysis and Machine Intelligence
Combining Pattern Classifiers: Methods and Algorithms

Combining Pattern Classifiers: Methods and Algorithms
Active learning using pre-clustering

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Diverse ensembles for active learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Combining proactive and reactive predictions for data streams

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Using additive expert ensembles to cope with concept drift

ICML '05 Proceedings of the 22nd international conference on Machine learning
Variational Bayes for Continuous Hidden Markov Models and Its Application to Active Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Class noise vs. attribute noise: a quantitative study of their impacts

Artificial Intelligence Review
Batch mode active learning and its application to medical image classification

ICML '06 Proceedings of the 23rd international conference on Machine learning
Suppressing model overfitting in mining concept-drifting data streams

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Class Noise Handling for Effective Cost-Sensitive Learning by Cost-Guided Iterative Classification Filtering

IEEE Transactions on Knowledge and Data Engineering
Rotation Forest: A New Classifier Ensemble Method

IEEE Transactions on Pattern Analysis and Machine Intelligence
Active Learning to Maximize Area Under the ROC Curve

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Data Streams: Models and Algorithms (Advances in Database Systems)

Data Streams: Models and Algorithms (Advances in Database Systems)
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Nonlinear Boosting Projections for Ensemble Construction

The Journal of Machine Learning Research
Active ensemble learning: Application to data mining and bioinformatics

Systems and Computers in Japan
A Low-Granularity Classifier for Data Streams with Concept Drifts and Biased Class Distribution

IEEE Transactions on Knowledge and Data Engineering
Active Learning from Data Streams

ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Learning more about active learning

Communications of the ACM - A Direct Path to Dependable Software
Cleansing Noisy Data Streams

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Active learning with statistical models

Journal of Artificial Intelligence Research
Unsupervised active learning based on hierarchical graph-theoretic clustering

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Mining Data Streams with Labeled and Unlabeled Training Examples

ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining
Vague One-Class Learning for Data Streams

ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining

Robust ensemble learning for mining noisy data streams

Decision Support Systems
Enabling fast prediction for ensemble models on data streams

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Predictive Data Stream Filtering

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 03
A framework for application-driven classification of data streams

Neurocomputing
Inconsistency-based active learning for support vector machines

Pattern Recognition
GP under streaming data constraints: a case for pareto archiving?

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Active learning for interactive machine translation

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
Decision Rule Extraction for Regularized Multiple Criteria Linear Programming Model

International Journal of Data Warehousing and Mining
Benchmarking pareto archiving heuristics in the presence of concept drift: diversity versus age

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Label free change detection on streaming data with cooperative multi-objective genetic programming

Proceedings of the 15th annual conference companion on Genetic and evolutionary computation
A Lattice-Computing ensemble for reasoning based on formal fusion of disparate data types, and an industrial dispensing application

Information Fusion

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we propose a new research problem on active learning from data streams, where data volumes grow continuously, and labeling all data is considered expensive and impractical. The objective is to label a small portion of stream data from which a model is derived to predict future instances as accurately as possible. To tackle the technical challenges raised by the dynamic nature of the stream data, i.e., increasing data volumes and evolving decision concepts, we propose a classifierensemble-based active learning framework that selectively labels instances from data streams to build a classifier ensemble. We argue that a classifier ensemble's variance directly corresponds to its error rate, and reducing a classifier ensemble's variance is equivalent to improving its prediction accuracy. Because of this, one should label instances toward theminimization of the variance of the underlying classifier ensemble. Accordingly, we introduce a minimum-variance (MV) principle to guide the instance labeling process for data streams. In addition, we derive an optimal-weight calculationmethod to determine the weight values for the classifier ensemble. The MV principle and the optimal weighting module are combined to build an active learning framework for data streams. Experimental results on synthetic and real-world data demonstrate the performance of the proposed work in comparison with other approaches.