Efficient Computation of Diverse Query Results

  • Authors:
  • Erik Vee;Utkarsh Srivastava;Jayavel Shanmugasundaram;Prashant Bhat;Sihem Amer Yahia

  • Affiliations:
  • Yahoo! Research, Sunnyvale, CA, USA. erikvee@yahoo-inc.com;Yahoo! Research, Sunnyvale, CA, USA. utkarsh@yahoo-inc.com;Yahoo! Research, Sunnyvale, CA, USA. jaishan@yahoo-inc.com;Yahoo! Research, Sunnyvale, CA, USA. pbhat@yahoo-inc.com;Yahoo! Research, Sunnyvale, CA, USA. sihem@yahoo-inc.com

  • Venue:
  • ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We study the problem of efficiently computing diverse query results in online shopping applications, where users specify queries through a form interface that allows a mix of structured and content-based selection conditions. Intuitively, the goal of diverse query answering is to return a representative set of top-k answers from all the tuples that satisfy the user selection condition. For example, if a user is searching for Honda cars and we can only display five results, we wish to return cars from five different Honda models, as opposed to returning cars from only one or two Honda models. A key contribution of this paper is to formally define the notion of diversity, and to show that existing score based techniques commonly used in web applications are not sufficient to guarantee diversity. Another contribution of this paper is to develop novel and efficient query processing techniques that guarantee diversity. Our experimental results using Yahoo! Autos data show that our proposed techniques are scalable and efficient.