Linguistics Tools: uma plataforma expansível de funções de consulta a corpus

  • Authors:
  • Nuno Caminada;Violeta Quental;Milena Garrão

  • Affiliations:
  • Instituto Militar de Engenharia, Rio de Janeiro, Brasil;Pontifícia Universidade Católica, Rio de Janeiro, Brasil;Pontifícia Universidade Católica, Rio de Janeiro, Brasil

  • Venue:
  • Companion Proceedings of the XIV Brazilian Symposium on Multimedia and the Web
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes Linguistics Tools, an extensible corpus query tool designed for the search for prepositional multi-word expressions in corpora of the Portuguese language, using classic algorithms such as T-Test, Log Likelihood and Mutual Information, but also leaving room for the implementation of further parsing and identification functions and algorithms. This tool was developed in the Java language and takes as input corpora annotated by the parser PALAVRAS (Bick2000). A description of the tool is given, and results from two corpora of different characteristics but of the same size are presented and compared.