A purity measure based transductive learning algorithm

Authors:
João Roberto Bertini Junior;Liang Zhao
Affiliations:
Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo, São Carlos, SP, Brasil;Instituto de Ciências Matemáticas e de Computação, Universidade de São Paulo, São Carlos, SP, Brasil
Venue:
ISNN'13 Proceedings of the 10th international conference on Advances in Neural Networks - Volume Part II
Year:
2013

Citing 5
Cited 0

Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
Text Classification from Labeled and Unlabeled Documents using EM

Machine Learning - Special issue on information retrieval
Graph-Based Semisupervised Learning

IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to Algorithms, Third Edition

Introduction to Algorithms, Third Edition
A nonparametric classification method based on K-associated graphs

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

The increasing on the human ability to gather data has led to an increasing effort on labeling them to be used in specific applications such as classification and regression. Therefore, automatic labeling methods such as semi-supervised transdutive learning algorithms are of a major concern on the machine learning and data mining community nowadays. This paper proposes a graph-based algorithm which uses the purity measure to help spreading the labels throughout the graph. The purity measure determines how intertwined are different subspaces of data regarding its classes. As high values of purity indicate low mixture among patterns of different classes, its maximization helps finding well-separated connected subgraphs; which facilitates the label spreading process. Results on benchmark data sets comparing to state-of-the-art methods show the potential of the proposed algorithm.