Improving web search relevance and freshness with content previews

Authors:
Siva Gurumurthy;Hang Su;Vasileios Kandylas;Vidhyashankar Venkataraman
Affiliations:
Yahoo! Inc., Sunnyvale, CA, USA;Yahoo! Inc., Sunnyvale, CA, USA;Yahoo! Inc., Sunnyvale, CA, USA;Yahoo! Inc., Sunnyvale, CA, USA
Venue:
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Year:
2010

Citing 14
Cited 0

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
IR evaluation methods for retrieving highly relevant documents

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Beyond PageRank: machine learning for static ranking

Proceedings of the 15th international conference on World Wide Web
Mining long-term search history to improve search accuracy

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
A comparison of feature selection methods for an evolving RSS feed corpus

Information Processing and Management: an International Journal - Special issue: Informetrics
Optimizing web search using social annotations

Proceedings of the 16th international conference on World Wide Web
Can social bookmarking improve web search?

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Retrieval and feedback models for blog feed search

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
NectaRSS, an intelligent RSS feed reader

Journal of Network and Computer Applications
Key blog distillation: ranking aggregates

Proceedings of the 17th ACM conference on Information and knowledge management
Mining rich session context to improve web search

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Towards recency ranking in web search

Proceedings of the third ACM international conference on Web search and data mining
Cobra: contentbased filtering and aggregation of blogs and RSS feeds

NSDI'07 Proceedings of the 4th USENIX conference on Networked systems design & implementation

Quantified Score

Hi-index	0.00

Visualization

Abstract

Traditional web search engines find it challenging to achieve good search quality for recency-sensitive queries, as they are prone to delays in discovering, indexing and ranking new web pages. In this paper we introduce PreGen, an adaptive preview generation system, which is run as part of a web search engine to improve search result quality for recency-sensitive queries. PreGen uses a machine learning algorithm to classify and select live web feeds, and generates "previews" of new web pages based on the link descriptions available in these feeds. The search engine can then index and present relevant page previews as part of its search results before the pages are fetched from the web, thereby reducing end-to-end delays. Our experiments show that PreGen improves the search relevance of a state-of-the-art search engine for recency-sensitive queries by 3% and reduces the average latencies of affected documents by 50%.