Grammatical category disambiguation by statistical optimization
Computational Linguistics
Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text
Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text
Ambiguity resolution in a reductionistic parser
EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Constraint grammar as a framework for parsing running text
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Syntactic analysis of natural language using linguistic rules and corpus-based patterns
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
A non-projective dependency parser
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Optimizing disambiguation in Swahili
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Data-Driven part-of-speech tagging of kiswahili
TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue
Hi-index | 0.00 |
The paper describes problems in disambiguating the morphological analysis of Bantu languages by using Swahili as a test language. The main factors of ambiguity in this language group can be traced to the noun class structure on one hand and to the bi-directional word-formation on the other. In analyzing word-forms, the system applied utilizes SWATWOL, a morphological parsing program based on two-level formalism. Disambiguation is carried out with the latest version (April 1996) of the Constraint Grammar Parser (CGP). Statistics on ambiguity are provided. Solutions for resolving different types of ambiguity are presented and they are demonstrated by examples from corpus text. Finally, statistics on the performance of the disambiguator are presented.