FindCredPg: a novel method to find credible pages based on trust web graph

Authors:
Teng Wang;Qing Zhu;Shan Wang;JingFan Liang
Affiliations:
Key Laboratory of the Ministry of Education for Data Engineering and Knowledge Engineering, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijin ...;Key Laboratory of the Ministry of Education for Data Engineering and Knowledge Engineering, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijin ...;Key Laboratory of the Ministry of Education for Data Engineering and Knowledge Engineering, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijin ...;Key Laboratory of the Ministry of Education for Data Engineering and Knowledge Engineering, Renmin University of China, Beijing, China and School of Information, Renmin University of China, Beijin ...
Venue:
APWeb'12 Proceedings of the 14th Asia-Pacific international conference on Web Technologies and Applications
Year:
2012

Citing 10
Cited 0

What makes Web sites credible?: a report on a large quantitative study

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A method of rating the credibility of news documents on the web

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Countering web spam with credibility-based link analysis

Proceedings of the twenty-sixth annual ACM symposium on Principles of distributed computing
Combating web spam with trustrank

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Tracking Web spam with HTML style similarities

ACM Transactions on the Web (TWEB)
Web-based evidence excavation to explore the authenticity of local events

Proceedings of the 2nd ACM workshop on Information credibility on the web
Honto? search: estimating trustworthiness of web information by search results aggregation and temporal analysis

APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Trust in wikipedia: how users trust information from an unknown source

Proceedings of the 4th workshop on Information credibility
Augmenting web pages and search results to support credibility assessment

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
T-verifier: Verifying truthfulness of fact statements

ICDE '11 Proceedings of the 2011 IEEE 27th International Conference on Data Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Finding credible pages is a challenging problem on the Web. Our key observation in this paper is that credible pages usually link to credible content-related pages, which is different from a normal page usually links to normal pages in spam page detection. We propose a novel method to find credible pages based on the trust web graph we define. This method first measures the content correlation between pages connected by hyperlinks, then it combines web link structure with content correlation value of pages to build a trust web graph. At last, credible pages are found successfully by using trust relation of vertices on the trust web graph. We construct a real-world data set by crawling millions of pages on the web and run a set of experiments on this data set. Experiment results show that the accuracy of this method is near 80% and the efficiency is higher.