ProbMap -- A probabilistic approach for mapping large document collections

  • Authors:
  • Thomas Hofmann

  • Affiliations:
  • Department of Computer Science, Brown University, Box 1910, Providence, RI 02912, USA. E-mail: th@cs.brown.edu

  • Venue:
  • Intelligent Data Analysis
  • Year:
  • 2000

Quantified Score

Hi-index 0.00

Visualization

Abstract

The visualization of large text databases and document collections is an important step towards more flexible and interactive types of information access and retrieval. This paper presents a probabilistic approach which combines a statistical, model-based analysis of a given set of documents with a topological visualization principle. Our method can be utilized to derive topic maps, which represent topical information by characteristic keyword distributions arranged in a two-dimensional spatial layout. Combined with multi-resolution techniques this provides a three-dimensional space for interactive information navigation in large text collections.