Towards a collection-based results diversification

  • Authors:
  • John A. Akinyemi;Charles L. A. Clarke;Maheedhar Kolla

  • Affiliations:
  • University of Waterloo, Waterloo, ON;University of Waterloo, Waterloo, ON;University of Waterloo, Waterloo, ON

  • Venue:
  • RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a method that introduces diversity into document retrieval using clusters of top-m terms obtained from the top-k retrieved documents through pseudo-relevance feedback. Terms from each cluster are used to automatically expand the original query. We evaluate the effectiveness of our method using a non-traditional effectiveness evaluation method, which directly measures the level of diversification by computing the cosine similarity between top-k retrieved documents based on (i) the original query and (ii) the expanded queries. Our results indicate that we can increase diversity without compromising retrieval quality.