Semi-supervised learning with an imperfect supervisor

  • Authors:
  • R. Amini;Patrick Gallinari

  • Affiliations:
  • University Pierre and Marie Curie, Department of Computer Science, 8 rue du capitaine Scott, 75015, Paris, France;University Pierre and Marie Curie, Department of Computer Science, 8 rue du capitaine Scott, 75015, Paris, France

  • Venue:
  • Knowledge and Information Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Real-life applications may involve huge data sets with misclassified or partially classified training data. Semi-supervised learning and learning in the presence of label noise have recently emerged as new paradigms in the machine learning community to cope with this kind of problems. This paper describes a new discriminant algorithm for semi-supervised learning. This algorithm optimizes the classification maximum likelihood (CML) of a set of labeled–unlabeled data, using a discriminant extension of the Classification Expectation Maximization algorithm. We further propose to extend this algorithm by modeling imperfections in the estimated class labels for unlabeled data. The parameters of this label-error model are learned together with the semi-supervised classifier parameters. We demonstrate the effectiveness of the approach using extensive experiments on different datasets.