The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
IR evaluation methods for retrieving highly relevant documents
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Optimizing search engines using clickthrough data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Beyond PageRank: machine learning for static ranking
Proceedings of the 15th international conference on World Wide Web
Mining long-term search history to improve search accuracy
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A comparison of feature selection methods for an evolving RSS feed corpus
Information Processing and Management: an International Journal - Special issue: Informetrics
Optimizing web search using social annotations
Proceedings of the 16th international conference on World Wide Web
Can social bookmarking improve web search?
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Retrieval and feedback models for blog feed search
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
NectaRSS, an intelligent RSS feed reader
Journal of Network and Computer Applications
Key blog distillation: ranking aggregates
Proceedings of the 17th ACM conference on Information and knowledge management
Mining rich session context to improve web search
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Towards recency ranking in web search
Proceedings of the third ACM international conference on Web search and data mining
Cobra: contentbased filtering and aggregation of blogs and RSS feeds
NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation
Hi-index | 0.00 |
Traditional web search engines find it challenging to achieve good search quality for recency-sensitive queries, as they are prone to delays in discovering, indexing and ranking new web pages. In this paper we introduce PreGen, an adaptive preview generation system, which is run as part of a web search engine to improve search result quality for recency-sensitive queries. PreGen uses a machine learning algorithm to classify and select live web feeds, and generates "previews" of new web pages based on the link descriptions available in these feeds. The search engine can then index and present relevant page previews as part of its search results before the pages are fetched from the web, thereby reducing end-to-end delays. Our experiments show that PreGen improves the search relevance of a state-of-the-art search engine for recency-sensitive queries by 3% and reduces the average latencies of affected documents by 50%.