Evaluating hierarchical clustering of search results

  • Authors:
  • Juan M. Cigarran;Anselmo Pen̈as;Julio Gonzalo;Felisa Verdejo

  • Affiliations:
  • Dept. Lenguajes y Sistemas Informáticos, E.T.S.I. Informática UNED;Dept. Lenguajes y Sistemas Informáticos, E.T.S.I. Informática UNED;Dept. Lenguajes y Sistemas Informáticos, E.T.S.I. Informática UNED;Dept. Lenguajes y Sistemas Informáticos, E.T.S.I. Informática UNED

  • Venue:
  • SPIRE'05 Proceedings of the 12th international conference on String Processing and Information Retrieval
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a goal-oriented evaluation measure, Hierarchy Quality, for hierarchical clustering algorithms applied to the task of organizing search results -such as the clusters generated by Vivisimo search engine-. Our metric considers the content of the clusters, their hierarchical arrangement, and the effort required to find relevant information by traversing the hierarchy starting from the top node. It compares the effort required to browse documents in a baseline ranked list with the minimum effort required to find the same amount of relevant information by browsing the hierarchy (which involves examining both documents and node descriptors).