Foundations of statistical natural language processing
Foundations of statistical natural language processing
Introduction to the special issue on the web as corpus
Computational Linguistics - Special issue on web as corpus
Hi-index | 0.00 |
This paper describes Linguistics Tools, an extensible corpus query tool designed for the search for prepositional multi-word expressions in corpora of the Portuguese language, using classic algorithms such as T-Test, Log Likelihood and Mutual Information, but also leaving room for the implementation of further parsing and identification functions and algorithms. This tool was developed in the Java language and takes as input corpora annotated by the parser PALAVRAS (Bick2000). A description of the tool is given, and results from two corpora of different characteristics but of the same size are presented and compared.