Automatic hyponymy identification from brazilian portuguese texts

  • Authors:
  • Leonardo Sameshima Taba;Helena de Medeiros Caseli

  • Affiliations:
  • Department of Computer Science, LaLiC/NILC, Federal University of São Carlos (UFSCar), São Carlos, SP, Brazil;Department of Computer Science, LaLiC/NILC, Federal University of São Carlos (UFSCar), São Carlos, SP, Brazil

  • Venue:
  • PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the natural language processing (NLP) scenario, Brazilian Portuguese (and Portuguese in general) still suffers from the lack of good quality base tools (e.g. parsers) and resources (e.g. annotated corpora). Corpora annotated with semantic information is particularly scarce and is a very costly resource to be produced manually. In order to provide some help to mend that situation, this paper presents an automatic hyponymy identification method for Brazilian Portuguese texts. The proposed method uses lexical and syntactic data alongside common sense information obtained from the Brazilian Open Mind Common Sense project (OMCS-Br). The results obtained so far are compatible with previous work and encourage other directions for further research.