Disambiguation of morphological analysis in Bantu languages

  • Authors:
  • Arvi Hurskainen

  • Affiliations:
  • University of Helsinki, Finland

  • Venue:
  • COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper describes problems in disambiguating the morphological analysis of Bantu languages by using Swahili as a test language. The main factors of ambiguity in this language group can be traced to the noun class structure on one hand and to the bi-directional word-formation on the other. In analyzing word-forms, the system applied utilizes SWATWOL, a morphological parsing program based on two-level formalism. Disambiguation is carried out with the latest version (April 1996) of the Constraint Grammar Parser (CGP). Statistics on ambiguity are provided. Solutions for resolving different types of ambiguity are presented and they are demonstrated by examples from corpus text. Finally, statistics on the performance of the disambiguator are presented.