The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
Perception of content, structure, and presentation changes in Web-based hypertext
Proceedings of the 12th ACM conference on Hypertext and Hypermedia
ChangeDetector: a site-level monitoring tool for the WWW
Proceedings of the 11th international conference on World Wide Web
Continual Queries for Internet Scale Event-Driven Information Delivery
IEEE Transactions on Knowledge and Data Engineering
The Evolution of the Web and Implications for an Incremental Crawler
VLDB '00 Proceedings of the 26th International Conference on Very Large Data Bases
Web Structure, Dynamics and Page Quality
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Trend detection through temporal link analysis
Journal of the American Society for Information Science and Technology - Special issue: Webometrics
Temporal ranking for fresh information retrieval
AsianIR '03 Proceedings of the sixth international workshop on Information retrieval with Asian languages - Volume 11
Effective change detection using sampling
VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Exploiting time-based synonyms in searching document archives
Proceedings of the 10th annual joint conference on Digital libraries
Determining time of queries for re-ranking search results
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
A web search method based on the temporal relation of query keywords
WISE'06 Proceedings of the 7th international conference on Web Information Systems
Learning to rank search results for time-sensitive queries
Proceedings of the 21st ACM international conference on Information and knowledge management
Hi-index | 0.00 |
Existing search engines contain the picture of the Web from the past and their ranking algorithms are based on data crawled some time ago. However, a user requires not only relevant but also fresh information. We have developed a method for adjusting the ranking of search engine results from the point of view of page freshness and relevance. It uses an algorithm that post-processes search engine results based on the changed contents of the pages. By analyzing archived versions of web pages we estimate temporal qualities of pages, that is, general freshness and relevance of the page to the query topic over certain time frames. For the top quality web pages, their content differences between past snapshots of the pages indexed by a search engine and their present versions are analyzed. Basing on these differences the algorithm assigns new ranks to the web pages without the need to maintain a constantly updated index of web documents.