Combining committee-based semi-supervised and active learning and its application to handwritten digits recognition

  • Authors:
  • Mohamed Farouk Abdel Hady;Friedhelm Schwenker

  • Affiliations:
  • Institute of Neural Information Processing, University of Ulm, Ulm, Germany;Institute of Neural Information Processing, University of Ulm, Ulm, Germany

  • Venue:
  • MCS'10 Proceedings of the 9th international conference on Multiple Classifier Systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Semi-supervised learning reduces the cost of labeling the training data of a supervised learning algorithm through using unlabeled data together with labeled data to improve the performance. Co-Training is a popular semi-supervised learning algorithm, that requires multiple redundant and independent sets of features (views). In many real-world application domains, this requirement can not be satisfied. In this paper, a single-view variant of Co-Training, CoBC (Co-Training by Committee), is proposed, which requires an ensemble of diverse classifiers instead of the redundant and independent views. Then we introduce two new learning algorithms, QBC-then-CoBC and QBC-with-CoBC, which combines the merits of committee-based semi-supervised learning and committee-based active learning. An empirical study on handwritten digit recognition is conducted where the random subspace method (RSM) is used to create ensembles of diverse C4.5 decision trees. Experiments show that these two combinations outperform the other non committee-based ones.