The corpus analysis toolkit - analysing multilevel annotations

  • Authors:
  • Stephen Wilson;Julie Carson-Berndsen

  • Affiliations:
  • School of Computer Science and Informatics, University College Dublin;School of Computer Science and Informatics, University College Dublin

  • Venue:
  • LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper considers a number of issues surrounding current annotation science and corpus analysis and presents a bespoke suite of software, the Corpus Analysis Toolkit, for processing and analysing multilevel annotations of time-aligned linguistic data. The toolkit provides a variety of specialised tools for performing temporal analysis of annotated linguistic data. The toolkit is feature-set and corpus independent and offers support for a number of commonly used annotations formats.