Active learning for class probability estimation and ranking

Authors:
Maytal Saar-Tsechansky;Foster Provost
Affiliations:
Department of Information Systems, Leonard N. Stern School of Business, New York University;Department of Information Systems, Leonard N. Stern School of Business, New York University
Venue:
IJCAI'01 Proceedings of the 17th international joint conference on Artificial intelligence - Volume 2
Year:
2001

Citing 15
Cited 22

A theory of the learnable

Communications of the ACM
Query by committee

COLT '92 Proceedings of the fifth annual workshop on Computational learning theory
C4.5: programs for machine learning

C4.5: programs for machine learning
A sequential algorithm for training text classifiers

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Improving Generalization with Active Learning

Machine Learning - Special issue on structured connectionist systems
A simple, fast, and effective rule learner

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Active learning using adaptive resampling

Proceedings of the sixth ACM SIGKDD international conference on Knowledge discovery and data mining
An Empirical Comparison of Voting Classification Algorithms: Bagging, Boosting, and Variants

Machine Learning
Queries and Concept Learning

Machine Learning
Queries and Concept Learning

Machine Learning
The Case against Accuracy Estimation for Comparing Induction Algorithms

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Query Learning Strategies Using Boosting and Bagging

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Learning and Making Decisions When Costs and Probabilities are Both Unknown

Learning and Making Decisions When Costs and Probabilities are Both Unknown
Active learning with statistical models

Journal of Artificial Intelligence Research
The use of the area under the ROC curve in the evaluation of machine learning algorithms

Pattern Recognition

Class Probability Estimation and Cost-Sensitive Classification Decisions

ECML '02 Proceedings of the 13th European Conference on Machine Learning
Active Sampling for Feature Selection

ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
On the Use of Optimization for Data Mining: Theoretical Interactions and eCRM Opportunities

Management Science
Diverse ensembles for active learning

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Efficiently exploring architectural design spaces via predictive modeling

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Efficient sampling of training set in large and noisy multimedia data

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Selectively Acquiring Customer Information: A New Data Acquisition Problem and an Active Learning-Based Solution

Management Science
Active learning for logistic regression: an evaluation

Machine Learning
Efficient architectural design space exploration via predictive modeling

ACM Transactions on Architecture and Code Optimization (TACO)
Customer targeting models using actively-selected web content

Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Dual Strategy Active Learning

ECML '07 Proceedings of the 18th European conference on Machine Learning
Mining in Large Noisy Domains

Journal of Data and Information Quality (JDIQ)
Learning when training data are costly: the effect of class distribution on tree induction

Journal of Artificial Intelligence Research
Active cost-sensitive learning

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Accurate and efficient processor performance prediction via regression tree based modeling

Journal of Systems Architecture: the EUROMICRO Journal
Cost-minimising strategies for data labelling: optimal stopping and active learning

FoIKS'08 Proceedings of the 5th international conference on Foundations of information and knowledge systems
Architecture performance prediction using evolutionary artificial neural networks

Evo'08 Proceedings of the 2008 conference on Applications of evolutionary computing
Active sampling for knowledge discovery from biomedical data

PKDD'05 Proceedings of the 9th European conference on Principles and Practice of Knowledge Discovery in Databases
Active learning for probability estimation using jensen-shannon divergence

ECML'05 Proceedings of the 16th European conference on Machine Learning
Active learning for probabilistic neural networks

ICNC'05 Proceedings of the First international conference on Advances in Natural Computation - Volume Part I
Musical sound recognition by active learning PNN

MRCS'06 Proceedings of the 2006 international conference on Multimedia Content Representation, Classification and Security
Model guided adaptive design and analysis in computer experiment

Statistical Analysis and Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

For many supervised learning tasks it is very costly to produce training data with class labels. Active learning acquires data incrementally, at each stage using the model learned so far to help identify especially useful additional data for labeling. Existing empirical active learning approaches have focused on learning classifiers. However, many applications require estimations of the probability of class membership, or scores that can be used to rank new cases. We present a new active learning method for class probability estimation (CPE) and ranking. BOOTSTRAP-LV selects new data for labeling based on the variance in probability estimates, as determined by learning multiple models from bootstrap samples of the existing labeled data. We show empirically that the method reduces the number of data items that must be labeled, across a wide variety of data sets. We also compare BOOTSTRAP-LV with UNCERTAINTY SAMPLING, an existing active learning method designed to maximize classification accuracy. The results show that BOOTSTRAP-LV dominates for CPE. Surprisingly it also often is preferable for accelerating simple accuracy maximization.