Document Clustering and Language Models for System-Mediated Information Access

  • Authors:
  • Gheorghe Muresan;David J. Harper

  • Affiliations:
  • -;-

  • Venue:
  • ECDL '01 Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents the novel concept of system-mediated information access, i.e. system support for the user in clarifying and refining a vague information need and in generating a good formulation for it. The concept is based on two main assumptions: firstly, on document clustering's ability to reveal the topical, semantic structure of a domain of interest, represented by a specialized collection, and secondly, on the capacity of language models to convey content. Experimental results show that these assumptions are correct and that there is potential to significantly improve the retrieval performance by generating a better query through mediation.