A statistical model for scientific readability
Proceedings of the tenth international conference on Information and knowledge management
Bilingual web page and site readability assessment
Proceedings of the 15th international conference on World Wide Web
SlideSeer: a digital library of aligned document and presentation pairs
Proceedings of the 7th ACM/IEEE-CS joint conference on Digital libraries
Combating web spam with trustrank
VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Predicting the readability of short web summaries
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Cognitively motivated features for readability assessment
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Revisiting readability: a unified framework for predicting text quality
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Easiest-first search: towards comprehension-based web search
Proceedings of the 18th ACM conference on Information and knowledge management
Adaptive ranking of search results by considering user's comprehension
Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
Estimating content concreteness for finding comprehensible documents
Proceedings of the sixth ACM international conference on Web search and data mining
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Hi-index | 0.00 |
We put forward a hypothesis that if there is a link from one page to another, it is likely that comprehensibility of the two pages is similar. To investigate whether this hypothesis is true or not, we conduct experiments using existing readability measures. We investigate the relationship between links and readability of text extracted from web pages for two datasets, set of English and Japanese pages. We could find that links and readability of text extracted from web pages are correlated. Based on the hypothesis, we propose a link analysis algorithm to measure comprehensibility of web pages. Our method is based on the Trust Rank algorithm which is originally used for combating web spam. We use link structure to propagate readability scores from source pages selected based on their comprehensibility. The results of experimental evaluation demonstrate that our method could improve estimation of comprehensibility of pages.