Combining labeled and unlabeled data with co-training
COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Semi-supervised support vector machines
Proceedings of the 1998 conference on Advances in neural information processing systems II
Transductive Inference for Text Classification using Support Vector Machines
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Identifying and Handling Mislabelled Instances
Journal of Intelligent Information Systems
Unsupervised word sense disambiguation rivaling supervised methods
ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
On convergence properties of the em algorithm for gaussian mixtures
Neural Computation
Optimization Techniques for Semi-Supervised Support Vector Machines
The Journal of Machine Learning Research
SemiBoost: Boosting for Semi-Supervised Learning
IEEE Transactions on Pattern Analysis and Machine Intelligence
Semi-Supervised Learning
SETRED: self-training with editing
PAKDD'05 Proceedings of the 9th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
CoTrade: Confident Co-Training With Data Editing
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Hi-index | 0.00 |
In this paper, a novel inductive support vector machine for semi-supervised learning, named IS3VM, is proposed, which aims to improve SVM by bootstrapping unlabeled data with self-training. The SVM classifier is iteratively refined through the augmentation of the training set. An improved self-training method is given by employing neighborhood graph for guarantying the reliability of newly added training examples. In detail, in each iteration of the self-training process, the local cut edge weight statistic is used to help estimate whether a newly labeled example is reliable or not, and only the reliable self-labeled examples are used to enlarge the labeled training set. Experiments show that, the improved self-training is beneficial and the proposed IS3VM algorithm can effectively exploit unlabeled data to achieve better performance, and is comparable to the-state-of-the-art semi-supervised SVM.