Disagreement-Based Co-training

Authors:
Jafar Tanha;Maarten van Someren;Hamideh Afsarmanesh
Affiliations:
-;-;-
Venue:
ICTAI '11 Proceedings of the 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence
Year:
2011

Citing 0
Cited 1

Boosting for multiclass semi-supervised learning

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recently, Semi-Supervised learning algorithms such as co-training are used in many domains. In co-training, two classifiers based on different subsets of the features or on different learning algorithms are trained in parallel and unlabeled data that are classified differently by the classifiers but for which one classifier has large confidence are labeled and used as training data for the other. In this paper, a new form of co-training, called Ensemble-Co-Training, is proposed that uses an ensemble of different learning algorithms. Based on a theorem by Angluin and Laird that relates noise in the data to the error of hypotheses learned from these data, we propose a criterion for finding a subset of high-confidence predictions and error rate for a classifier in each iteration of the training process. Experiments show that the new method in almost all domains gives better results than the state-of-the-art methods.