Best-match method used in co-training algorithm

  • Authors:
  • Hui Wang;Liping Ji;Wanli Zuo

  • Affiliations:
  • College of Computer Science and Technology, Jilin University, Key Laboratory of Symbolic Computation and Knowledge, Engineering of the Ministry of Education, Changchun, China;College of Computer Science and Technology, Jilin University, Key Laboratory of Symbolic Computation and Knowledge, Engineering of the Ministry of Education, Changchun, China;College of Computer Science and Technology, Jilin University, Key Laboratory of Symbolic Computation and Knowledge, Engineering of the Ministry of Education, Changchun, China

  • Venue:
  • PAKDD'07 Proceedings of the 2007 international conference on Emerging technologies in knowledge discovery and data mining
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Since 1998 there has been significant interest in supervised learning algorithms that combine labeled and unlabeled data for text learning tasks. The co-training algorithm applied to datasets which have a natural separation of their features into two disjoint sets. In this paper, we demonstrate that when learning from labeled and unlabeled data using co-training algorithm, selecting those document examples first which have two parts of best matching features can obtain a good performance.