The notion of data and its quality dimensions
Information Processing and Management: an International Journal
Toward quality data: an attribute-based approach
Decision Support Systems - Special issue on information technologies and systems
The nature of statistical learning theory
The nature of statistical learning theory
Communications of the ACM
The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Authoritative sources in a hyperlinked environment
Proceedings of the ninth annual ACM-SIAM symposium on Discrete algorithms
Real life, real users, and real needs: a study and analysis of user queries on the web
Information Processing and Management: an International Journal
Does “authority” mean quality? predicting expert quality ratings of Web documents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Communications of the ACM - Supporting community and building social capital
Web Wisdom; How to Evaluate and Create Information Quality on the Webb
Web Wisdom; How to Evaluate and Create Information Quality on the Webb
Web Site Optimization Using Page Popularity
IEEE Internet Computing
Data-rich Section Extraction from HTML pages
WISE '02 Proceedings of the 3rd International Conference on Web Information Systems Engineering
Web Structure, Dynamics and Page Quality
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Genre based Navigation on the Web
HICSS '01 Proceedings of the 34th Annual Hawaii International Conference on System Sciences ( HICSS-34)-Volume 4 - Volume 4
Reproduced and emergent genres of communication on the World-Wide Web
HICSS '97 Proceedings of the 30th Hawaii International Conference on System Sciences: Digital Documents - Volume 6
Impact of search engines on page popularity
Proceedings of the 13th international conference on World Wide Web
Learning block importance models for web pages
Proceedings of the 13th international conference on World Wide Web
User Expectations and Rankings of Quality Factors in Different Web Site Domains
International Journal of Electronic Commerce
A "quick and dirty" website data quality indicator
Proceedings of the 2nd ACM workshop on Information credibility on the web
Informing observers: quality-driven filtering and composition of web 2.0 sources
Proceedings of the 2012 Joint EDBT/ICDT Workshops
Hi-index | 0.00 |
Currently, search engines rank search results using mainly link-based metrics. While usually most of the search results are relevant to a user's query, due to how the results are ranked, users often are still not totally satisfied with them. Using a proposed framework of web data quality, it is found that current search engines usually only consider a very small number of the dimensions of web data quality in their ranking algorithms. In this paper, a newly identified web data-quality dimension, appropriateness, which is based on the linguistic and visual complexity of a web page, is studied. It is computed using new metrics that classify web pages into three main appropriateness genres: scholarly, news/general interest and popular. Experiments have shown the effectiveness of the metrics in ranking web pages by whether they are appropriate to a user's task and information needs.