Distribution of relevant documents in domain-level aggregates for topic distillation

Authors:
V. Plachouras;I. Ounis
Affiliations:
University of Glasgow, Glasgow, United Kingdom;University of Glasgow, Glasgow, United Kingdom
Venue:
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters
Year:
2004

Citing 4
Cited 0

Identifying aggregates in hypertext structures

HYPERTEXT '91 Proceedings of the third annual ACM conference on Hypertext
Defining logical domains in a web site

HYPERTEXT '00 Proceedings of the eleventh ACM on Hypertext and hypermedia
Probabilistic models of information retrieval based on measuring the divergence from randomness

ACM Transactions on Information Systems (TOIS)
Untangling compound documents on the web

Proceedings of the fourteenth ACM conference on Hypertext and hypermedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we study the distribution of relevant documents in aggregates, formed by grouping the retrieved documents according to their domain. For each aggregate, we take into account its size, and a measure of the correlation between its incoming and outgoing hyperlinks. We report on a preliminary experiment with two TREC topic distillation tasks, where we find that larger aggregates, or those aggregates with correlated hyperlinks, are more likely to contain relevant documents. This result shows that the distribution of domain-level aggregates is potentially useful for finding relevant documents.