Improving co-training with agreement-based sampling

Authors:
Jin Huang;Jelber Sayyad Shirabad;Stan Matwin;Jiang Su
Affiliations:
School of Information Technology and Engineering, University of Ottawa, Canada;School of Information Technology and Engineering, University of Ottawa, Canada;School of Information Technology and Engineering, University of Ottawa, Canada and Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland;School of Information Technology and Engineering, University of Ottawa, Canada
Venue:
RSCTC'10 Proceedings of the 7th international conference on Rough sets and current trends in computing
Year:
2010

Citing 9
Cited 0

Selective Sampling Using the Query by Committee Algorithm

Machine Learning
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Data mining: practical machine learning tools and techniques with Java implementations

Data mining: practical machine learning tools and techniques with Java implementations
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Analyzing the effectiveness and applicability of co-training

Proceedings of the ninth international conference on Information and knowledge management
Support Vector Machine Active Learning with Application sto Text Classification

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Selective Sampling with Redundant Views

Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Learning from Labeled and Unlabeled Data using Graph Mincuts

ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
On multi-view active learning and the combination with semi-supervised learning

Proceedings of the 25th international conference on Machine learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

Co-training is an effective semi-supervised learning method which uses unlabeled instances to improve prediction accuracy. In the cotraining process, a random sampling is used to gradually select unlabeled instances to train classifiers. In this paper we explore whether other sampling methods can improve co-training performance. A novel selective sampling method, agreement-based sampling, is proposed. Experimental results show that our new sampling method can improve co-training significantly.