Information structure in African languages: corpora and tools

  • Authors:
  • Christian Chiarcos;Ines Fiedler;Mira Grubic;Katharina Hartmann;Julia Ritz;Anne Schwarz;Amir Zeldes;Malte Zimmermann

  • Affiliations:
  • Universität Potsdam, Potsdam, Germany 14476;Humboldt-Universität zu Berlin, Berlin, Germany 10099;Universität Potsdam, Potsdam, Germany 14476;Humboldt-Universität zu Berlin, Berlin, Germany 10099;Universität Potsdam, Potsdam, Germany 14476;The Cairns Institute / James Cook University, Cairns, Australia 4870;Humboldt-Universität zu Berlin, Berlin, Germany 10099;Universität Potsdam, Potsdam, Germany 14476

  • Venue:
  • Language Resources and Evaluation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe tools and resources for the study of African languages developed at the Collaborative Research Centre 632 "Information Structure". These include deeply annotated data collections of 25 sub-Saharan languages that are described together with their annotation scheme, as well as the corpus tool ANNIS, which provides unified access to a broad variety of annotations created with a range of different tools. With the application of ANNIS to several African data collections, we illustrate its suitability for the purpose of language documentation, distributed access, and the creation of data archives.