Selective Sampling Based on the Variation in Label Assignments

Authors:
Piotr Juszczak;Robert P. W. Duin
Affiliations:
Delft University of Technology, The Netherlands;Delft University of Technology, The Netherlands
Venue:
ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Year:
2004

Citing 0
Cited 3

Open Set Face Recognition Using Transduction

IEEE Transactions on Pattern Analysis and Machine Intelligence
Informative sampling for large unbalanced data sets

Proceedings of the 10th annual conference companion on Genetic and evolutionary computation
Coevolution of simulator proxies and sampling strategies for petroleum reservoir modeling

CEC'09 Proceedings of the Eleventh conference on Congress on Evolutionary Computation

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, a new selective sampling method for the active learning framework is presented.Initially, a small training set T and a large unlabeled set 驴 are given.The goal is to select, one by one, the most informative objects from 驴 such that, after labeling by an expert, they will guarantee the best improvement in the classifier performance. Our sampling strategy relies on measuring the variation in label assignments (of the unlabeled set) between the classifier trained on T and the classifiers trained on T with a single unlabeled object added with all possible labels. We compare the performance of our algorithm with two traditional procedures random sampling and uncertainty sampling. We show empirically across a range of datasets that the proposed selective sampling method decreases the number of labeled instances needed to achieve the desired error for the fixed size of T.Experimental results on toy problems and the UCI datasets are presented.