Larger is better: seed selection in link-based anti-spamming algorithms
Proceedings of the 17th international conference on World Wide Web
Exploiting bidirectional links: making spamming detection easier
Proceedings of the 18th ACM conference on Information and knowledge management
Foundations and Trends in Information Retrieval
Reliability prediction of webpages in the medical domain
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Detecting Fake Medical Web Sites Using Recursive Trust Labeling
ACM Transactions on Information Systems (TOIS)
Combating Web spam through trust-distrust propagation with confidence
Pattern Recognition Letters
Hi-index | 0.00 |
Search engines are playing a more and more important role in discovering information on the web nowadays. Spam web pages, however, are employing various tricks to bamboozle search engines, therefore achieving undeserved ranks. In this paper, we propose a new page importance metric, which takes both the content quality and the link quality into consideration. Based on this metric, we can judge the trust scores of all the web pages using the web link graph. Experimental results running on over 15 million web pages show that our method can filter out spam and identify reputable sites effectively.