Termite: visualization techniques for assessing textual topic models

  • Authors:
  • Jason Chuang;Christopher D. Manning;Jeffrey Heer

  • Affiliations:
  • Stanford University;Stanford University;Stanford University

  • Venue:
  • Proceedings of the International Working Conference on Advanced Visual Interfaces
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Topic models aid analysis of text corpora by identifying latent topics based on co-occurring words. Real-world deployments of topic models, however, often require intensive expert verification and model refinement. In this paper we present Termite, a visual analysis tool for assessing topic model quality. Termite uses a tabular layout to promote comparison of terms both within and across latent topics. We contribute a novel saliency measure for selecting relevant terms and a seriation algorithm that both reveals clustering structure and promotes the legibility of related terms. In a series of examples, we demonstrate how Termite allows analysts to identify coherent and significant themes.