Memory-based morphological analysis
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Evaluating CETEMPúblico, a free resource for Portuguese
ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
Memory-Based Language Processing (Studies in Natural Language Processing)
Memory-Based Language Processing (Studies in Natural Language Processing)
An account of the challenge of tagging a reference corpus for Brazilian Portuguese
PROPOR'03 Proceedings of the 6th international conference on Computational processing of the Portuguese language
Contractions: breaking the tokenization-tagging circularity
PROPOR'03 Proceedings of the 6th international conference on Computational processing of the Portuguese language
Hi-index | 0.00 |
We present a newly available on-line resource for Portuguese, a corpus of 310 million words, a new version of the Reference Corpus of Contemporary Portuguese, now searchable via a user-friendly web interface. Here we report on work carried out on the corpus previous to its publication on-line. We focus on the processes and tools involved for the cleaning, preparation and annotation to make the corpus suitable for linguistic inquiries.