Fundamentals of speech recognition
Fundamentals of speech recognition
Learning from labeled and unlabeled data on a directed graph
ICML '05 Proceedings of the 22nd international conference on Machine learning
Classification in Networked Data: A Toolkit and a Univariate Case Study
The Journal of Machine Learning Research
Efficient and simple generation of random simple connected graphs with prescribed degree sequence
COCOON'05 Proceedings of the 11th annual international conference on Computing and Combinatorics
Within-Network Classification Using Local Structure Similarity
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
A nonparametric classification method based on K-associated graphs
Information Sciences: an International Journal
Expert Systems with Applications: An International Journal
Review of statistical network analysis: models, algorithms, and software
Statistical Analysis and Data Mining
A link-analysis-based discriminant analysis for exploring partially labeled graphs
Pattern Recognition Letters
Data Mining and Knowledge Discovery
Hi-index | 0.00 |
This paper describes a novel technique, called $\mathcal{D}$-walks, to tackle semi-supervised classification problems in large graphs. We introduce here a betweenness measure based on passage times during random walks of bounded lengths. Such walks are further constrained to start and end in nodes within the same class, defining a distinct betweenness for each class. Unlabeled nodes are classified according to the class showing the highest betweenness. Forward and backward recurrences are derived to efficiently compute the passage times. $\mathcal{D}$-walks can deal with directed or undirected graphs with a linear time complexity with respect to the number of edges, the maximum walk length considered and the number of classes. Experiments on various real-life databases show that $\mathcal{D}$-walks outperforms NetKit [5], the approach of Zhou and Schölkopf [15] and the regularized laplacian kernel [2]. The benefit of $\mathcal{D}$-walks is particularly noticeable when few labeled nodes are available. The computation time of $\mathcal{D}$-walks is also substantially lower in all cases.