Corpus clouds - facilitating text analysis by means of visualizations

  • Authors:
  • Chris Culy;Verena Lyding

  • Affiliations:
  • European Academy Bozen/Bolzano, Viale Druso, Bolzano, Italy;European Academy Bozen/Bolzano, Viale Druso, Bolzano, Italy

  • Venue:
  • LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Large text corpora are a main language resource for the humandriven analysis of linguistic phenomena. With the ever increasing amount of data, it is vital to find ways to help people understand the data, and visualization techniques provide one way to do that. Corpus Clouds is a program which provides visualizations of different types of frequency information dynamically derived from a corpus via a standard query system, integrated with a standard KWIC display. We apply established principles from information visualization to provide dynamic, interactive representations of the query results. The selected design principles and alternatives to the implementation will be discussed and a preview on what other types of information connected to corpora can be visualized in similar ways are provided. Corpus Clouds can thus be seen as answer. to the call by Collins et al. [1] to design in a principled way new visualization tools for linguistic data.