Authoritative sources in a hyperlinked environment
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Tri-Training: Exploiting Unlabeled Data Using Three Classifiers
IEEE Transactions on Knowledge and Data Engineering
Link based small sample learning for web spam detection
Proceedings of the 18th international conference on World wide web
Hi-index | 0.00 |
In web spam detection, we propose a new semi-supervised learning algorithm named HFSSL (harmonic functions based semi-supervised learning). In our method, labeled and unlabeled web pages are represented as vertices in a weighted graph. The learning problem is then modeled as a Gaussian random field on this graph, where the mean of the field is characterized by harmonic functions, which can be efficiently obtained using matrix methods. The experiments on standard WEBSPAM-UK2006 show that our algorithm is effective.