Docuburst: visualizing document content using language structure

  • Authors:
  • Christopher Collins;Sheelagh Carpendale;Gerald Penn

  • Affiliations:
  • University of Toronto, Toronto, Canada;University of Calgary, Calgary, Canada;University of Toronto, Toronto, Canada

  • Venue:
  • EuroVis'09 Proceedings of the 11th Eurographics / IEEE - VGTC conference on Visualization
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Textual data is at the forefront of information management problems today. One response has been the development of visualizations of text data. These visualizations, commonly based on simple attributes such as relative word frequency, have become increasingly popular tools. We extend this direction, presenting the first visualization of document content which combines word frequency with the human-created structure in lexical databases to create a visualization that also reflects semantic content. DocuBurst is a radial, space-filling layout of hyponymy (the IS-A relation), overlaid with occurrence counts of words in a document of interest to provide visual summaries at varying levels of granularity. Interactive document analysis is supported with geometric and semantic zoom, selectable focus on individual words, and linked access to source text.