Two-view feature generation model for semi-supervised learning

  • Authors:
  • Rie Kubota Ando;Tong Zhang

  • Affiliations:
  • IBM T. J. Watson Research Center, Hawthorne, New York;Yahoo Inc., New York, New York

  • Venue:
  • Proceedings of the 24th international conference on Machine learning
  • Year:
  • 2007

Quantified Score

Hi-index 0.02

Visualization

Abstract

We consider a setting for discriminative semi-supervised learning where unlabeled data are used with a generative model to learn effective feature representations for discriminative training. Within this framework, we revisit the two-view feature generation model of co-training and prove that the optimum predictor can be expressed as a linear combination of a few features constructed from unlabeled data. From this analysis, we derive methods that employ two views but are very different from co-training. Experiments show that our approach is more robust than co-training and EM, under various data generation conditions.