Diversifying web search results

Authors:
Davood Rafiei;Krishna Bharat;Anand Shukla
Affiliations:
University of Alberta, Edmonton, AB, Canada;Google Inc., Mountain View, CA, USA;Google Inc., Mountain View, CA, USA
Venue:
Proceedings of the 19th international conference on World wide web
Year:
2010

Citing 15
Cited 35

Practical methods of optimization; (2nd ed.)

Practical methods of optimization; (2nd ed.)
The use of MMR, diversity-based reranking for reordering documents and producing summaries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Modern Information Retrieval

Modern Information Retrieval
Beyond independent relevance: methods and evaluation metrics for subtopic retrieval

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Improving web search results using affinity graph

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A picture of search

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Less is more: probabilistic models for retrieving fewer relevant documents

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Improving personalized web search using result diversification

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Predicting clicks: estimating the click-through rate for new ads

Proceedings of the 16th international conference on World Wide Web
Information re-retrieval: repeat queries in Yahoo's logs

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Novelty and diversity in information retrieval evaluation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Diversifying image search with user generated content

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Diversifying search results

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Efficient Computation of Diverse Query Results

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Portfolio theory of information retrieval

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval

A comparative analysis of cascade measures for novelty and diversity

Proceedings of the fourth ACM international conference on Web search and data mining
Multi-dimensional search result diversification

Proceedings of the fourth ACM international conference on Web search and data mining
Trends in search interaction

Search computing
Efficient diversification of web search results

Proceedings of the VLDB Endowment
Active learning to maximize accuracy vs. effort in interactive information retrieval

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Evaluating diversified search results using per-intent graded relevance

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
On diversifying and personalizing web search

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Selecting a comprehensive set of reviews

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Aggregated search result diversification

ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
A multi-faceted approach to query intent classification

SPIRE'11 Proceedings of the 18th international conference on String processing and information retrieval
Diversifying Product Review Rankings: Getting the Full Picture

WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Coreference aware web object retrieval

Proceedings of the 20th ACM international conference on Information and knowledge management
Intent-based diversification of web search results: metrics and algorithms

Information Retrieval
Relational click prediction for sponsored search

Proceedings of the fifth ACM international conference on Web search and data mining
Evaluation with informational and navigational intents

Proceedings of the 21st international conference on World Wide Web
Max-Sum diversification, monotone submodular functions and dynamic updates

PODS '12 Proceedings of the 31st symposium on Principles of Database Systems
Top-k bounded diversification

SIGMOD '12 Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data
Search intent estimation from user's eye movements for supporting information seeking

Proceedings of the International Working Conference on Advanced Visual Interfaces
Diversity by proportionality: an election-based approach to search result diversification

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Personalized diversification of search results

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Combining implicit and explicit topic representations for result diversification

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Diversification for multi-domain result sets

ICWE'12 Proceedings of the 12th international conference on Web Engineering
Reranking web search results for diversity

Information Retrieval
On the role of novelty for search result diversification

Information Retrieval
Efficient jaccard-based diversity analysis of large document collections

Proceedings of the 21st ACM international conference on Information and knowledge management
Measuring the coverage and redundancy of information search services on e-commerce platforms

Electronic Commerce Research and Applications
mNIR: diversifying search results based on a mixture of novelty, intention and relevance

WISE'12 Proceedings of the 13th international conference on Web Information Systems Engineering
Preference based evaluation measures for novelty and diversity

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Top-k diversity queries over bounded regions

ACM Transactions on Database Systems (TODS)
Diversity maximization under matroid constraints

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Search result presentation: supporting post-search navigation by integration of taxonomy data

Proceedings of the 22nd international conference on World Wide Web companion
Groundhog day: near-duplicate detection on Twitter

Proceedings of the 22nd international conference on World Wide Web
To personalize or not: a risk management perspective

Proceedings of the 7th ACM conference on Recommender systems
Mining subtopics from different aspects for diversifying search results

Information Retrieval
Mining subtopics from text fragments for a web query

Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

Result diversity is a topic of great importance as more facets of queries are discovered and users expect to find their desired facets in the first page of the results. However, the underlying questions of how 'diversity' interplays with 'quality' and when preference should be given to one or both are not well-understood. In this work, we model the problem as expectation maximization and study the challenges of estimating the model parameters and reaching an equilibrium. One model parameter, for example, is correlations between pages which we estimate using textual contents of pages and click data (when available). We conduct experiments on diversifying randomly selected queries from a query log and the queries chosen from the disambiguation topics of Wikipedia. Our algorithm improves upon Google in terms of the diversity of random queries, retrieving 14% to 38% more aspects of queries in top 5, while maintaining a precision very close to Google. On a more selective set of queries that are expected to benefit from diversification, our algorithm improves upon Google in terms of precision and diversity of the results, and significantly outperforms another baseline system for result diversification.