Semi-supervised learning with explicit misclassification modeling

  • Authors:
  • Massih-Reza Amini;Patrick Gallinari

  • Affiliations:
  • University of Pierre and Marie Curie, Computer Science Laboratory of Paris 6, Paris, France;University of Pierre and Marie Curie, Computer Science Laboratory of Paris 6, Paris, France

  • Venue:
  • IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper investigates a new approach for training discriminant classifiers when only a small set of labeled data is available together with a large set of unlabeled data. This algorithm optimizes the classification maximum likelihood of a set of labeled-unlabeled data, using a variant form of the Classification Expectation Maximization (CEM) algorithm. Its originality is that it makes use of both unlabeled data and of a probabilistic misclassification model for these data. The parameters of the label-error model are learned together with the classifier parameters. We demonstrate the effectiveness of the approach on four data-sets and show the advantages of this method over a previously developed semi-supervised algorithm which does not consider imperfections in the labeling process.