Improving co-training with agreement-based sampling

  • Authors:
  • Jin Huang;Jelber Sayyad Shirabad;Stan Matwin;Jiang Su

  • Affiliations:
  • School of Information Technology and Engineering, University of Ottawa, Canada;School of Information Technology and Engineering, University of Ottawa, Canada;School of Information Technology and Engineering, University of Ottawa, Canada and Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland;School of Information Technology and Engineering, University of Ottawa, Canada

  • Venue:
  • RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Co-training is an effective semi-supervised learning method which uses unlabeled instances to improve prediction accuracy. In the cotraining process, a random sampling is used to gradually select unlabeled instances to train classifiers. In this paper we explore whether other sampling methods can improve co-training performance. A novel selective sampling method, agreement-based sampling, is proposed. Experimental results show that our new sampling method can improve co-training significantly.