A language modelling approach to relevance profiling for document browsing

  • Authors:
  • David J. Harper;Sara Coulthard;Sun Yixing

  • Affiliations:
  • The Robert Gordon University School of Computing, Aberdeen, UK;The Robert Gordon University School of Computing, Aberdeen, UK;The Robert Gordon University School of Computing, Aberdeen, UK

  • Venue:
  • Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes a novel tool, SmartSkim, for content-based browsing or skimming of documents. The tool integrates concepts from passage retrieval and from interfaces, such as TileBars, which provide a compact overview of query term hits within a document. We base our tool on the concept of relevance profiling, in which a plot of retrieval status values at each word position of a document is generated. A major contribution of this paper is applying language modelling to the task of relevance profiling. We describe in detail the design of the SmartSkim tool, and provide a critique of the design. Possible applications of the tool are described, and we consider how an operational version of SmartSkim might be designed.