The Acquisition of Some Lexical Constraints from Corpora
TSD '99 Proceedings of the Second International Workshop on Text, Speech and Dialogue
INTEX: a corpus processing system
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Evaluation of an algorithm for the recognition and classification of proper names
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Hi-index | 0.00 |
In this paper we will present an approach to acquisition of some classes of compound words from large corpora, as well as a method for semi-automatic generation of appropriate linguistic models, that can be further used for compound word recognition and for completion of compound word dictionaries. The approach is intended for a highly inflective language such as Serbo-Croatian. Generated linguistic models are represented by local grammars.