Flexible intrinsic evaluation of hierarchical clustering for TDT

Authors:
James Allan;Ao Feng;Alvaro Bolivar
Affiliations:
University of Massachusetts, MA;University of Massachusetts, MA;University of Massachusetts, MA
Venue:
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
Year:
2003

Citing 9
Cited 11

The effectiveness and efficiency of agglomerative hierarchic clustering in document retrieval

The effectiveness and efficiency of agglomerative hierarchic clustering in document retrieval
Recent trends in hierarchic document clustering: a critical review

Information Processing and Management: an International Journal
Evaluating document clustering for interactive information retrieval

Proceedings of the tenth international conference on Information and knowledge management
Topic Detection and Tracking: Event-Based Information Organization

Topic Detection and Tracking: Event-Based Information Organization
Information Retrieval

Information Retrieval
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Topic detection and tracking evaluation overview

Topic detection and tracking
Corpora for topic detection and tracking

Topic detection and tracking
Explorations within topic tracking and detection

Topic detection and tracking

Event threading within news topics

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Profile-based event tracking

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Scalable hierarchical topic detection: exploring a sample based approach

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Navigation behavior models for link structure optimization

User Modeling and User-Adapted Interaction
Unsupervised Topic Detection in document collections: an application in marketing and business journals

International Journal of Business Intelligence and Data Mining
Timeline Analysis of Web News Events

ADMA '08 Proceedings of the 4th international conference on Advanced Data Mining and Applications
Recovery Rate of Clustering Algorithms

PSIVT '09 Proceedings of the 3rd Pacific Rim Symposium on Advances in Image and Video Technology
Improving the dynamic hierarchical compact clustering algorithm by using feature selection

CIARP'10 Proceedings of the 15th Iberoamerican congress conference on Progress in pattern recognition, image analysis, computer vision, and applications
Following the social media: aspect evolution of online discussion

SBP'11 Proceedings of the 4th international conference on Social computing, behavioral-cultural modeling and prediction
Event detection with spatial latent Dirichlet allocation

Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Clustering-Based searching and navigation in an online news source

ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Topic Detection and Tracking (TDT) evaluation program has included a "cluster detection" task since its inception in 1996. Systems were required to process a stream of broadcast news stories and partition them into non-overlapping clusters. A system's effectiveness was measured by comparing the generated clusters to "truth" clusters created by human annotators. Starting in 2003, TDT is moving to a more realistic model that permits overlapping clusters (stories may be on more than one topic) and encourages the creation of a hierarchy to structure the relationships between clusters (topics). We explore a range of possible evaluation models for this modified TDT clustering task to understand the best approach for mapping between the human-generated "truth" clusters and a much richer hierarchical structure. We demonstrate that some obvious evaluation techniques fail for degenerate cases. For a few others we attempt to develop an intuitive sense of what the evaluation numbers mean. We settle on some approaches that incorporate a strong balance between cluster errors (misses and false alarms) and the distance it takes to travel between stories within the hierarchy.