Linguistic issues in the development of ReGra: A grammar checker for Brazilian Portuguese

  • Authors:
  • Ronaldo Teixeira Martins;Ricardo Hasegawa;Maria Das Graças VolpeNunes;Gisele Montilha;Osvaldo Novais De Oliveira, Jr.

  • Affiliations:
  • Instituto de Estudos da Linguagem, Unicamp, Campinas, SP, Brazil, e-mail: rotma@nilc.icmsc.sc.usp.br;Instituto de Ciências Matemáticas de São Carlos – USP – CP 668 – 13560-970 São Carlos, SP, Brazil, e-mail: rh@icmsc.sc.usp.br mdgvnune@icmsc.sc.usp.br;Instituto de Ciências Matemáticas de São Carlos – USP – CP 668 – 13560-970 São Carlos, SP, Brazil, e-mail: rh@icmsc.sc.usp.br mdgvnune@icmsc.sc.usp.br;Departamento de Letras, Unesp, Araraquara, SP, Brazil, e-mail: gisele@nilc.icmsc.sc.usp.br;Instituto de Física de São Carlos, USP – CP 369 – 13560-970 São Carlos, SP, Brazil, e-mail: chu@ifqsc.sc.usp.br

  • Venue:
  • Natural Language Engineering
  • Year:
  • 1998

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a number of linguistic and computational issues identified during the implementation of a general use grammar checker for contemporary Brazilian Portuguese, ReGra, that has been incorporated in the word processor REDATOR by Itautec/Philco (Brazil). Two main strategies were employed in the implementation of correction rules: an error-driven, localist approach based on the identification of patterns indicative of grammatical mistakes; and a more generic approach that requires automatic syntactic analysis. In this discussion, particular emphasis is given to the development of a parser based on a phrase structure grammar comprising over 600 production rules. As for the computational performance, ReGra permits texts to be revised at a rate of ca. 200 words per second.