Scalable hierarchical topic detection: exploring a sample based approach

  • Authors:
  • Dolf Trieschnigg;Wessel Kraaij

  • Affiliations:
  • University of Twente, Enschede, The Netherlands;TNO, Delft, The Netherlands

  • Venue:
  • Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hierarchical topic detection is a new task in the TDT 2004 evaluation program, which aims to organize an unstructured news collection in a directed acyclic graph (DAG) structure, reflecting the topics discussed. We present a scalable architecture for HTD and compare several alternative choices for agglomerative clustering and DAG optimization in order to minimize the HTD cost metric.