Cognate mapping: a heuristic strategy for the semi-supervised acquisition of a Spanish lexicon from a Portuguese seed lexicon

  • Authors:
  • Stefan Schulz;Kornél Markó;Eduardo Sbrissia;Percy Nohama;Udo Hahn

  • Affiliations:
  • Paraná Catholic University, Curitiba, Brazil and Freiburg University Hospital, Germany;Freiburg University Hospital, Germany and Jena University, Germany;Paraná Catholic University, Curitiba, Brazil;Paraná Catholic University, Curitiba, Brazil;Jena University, Germany

  • Venue:
  • COLING '04 Proceedings of the 20th international conference on Computational Linguistics
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We deal with the automated acquisition of a Spanish medical subword lexicon from an already existing Portuguese seed lexicon. Using two non-parallel monolingual corpora we determined Spanish lexeme candidates from Portuguese seed lexicon entries by heuristic cognate mapping. We validated the emergent lexical translation hypotheses by determining the similarity of fixed-window context vectors on the basis of Portuguese and Spanish text corpora.