Automatic acquisition of wordnet relations by distributionally supported morphological patterns extracted from Polish corpora

  • Authors:
  • Roman Kurc;Maciej Piasecki;Stan Szpakowicz

  • Affiliations:
  • Institute of Informatics, Wrocław University of Technology, Poland;Institute of Informatics, Wrocław University of Technology, Poland;SITE, University of Ottawa, Canada and Institute of Computer Science, Polish Academy of Sciences, Poland

  • Venue:
  • TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Espresso is a pattern-based algorithm of extracting lexical-semantic relations, defined for English. We present its adaptation to Polish. We consider not only the technicalities such as the availability of language-processing tools for Polish, but also pattern structures which leverage the specificity of a strongly inflected language. We propose a new method of computing the reliability measure of extraction; this leads to a modified algorithm which we have named Estratto. In this paper we investigate the influence of additional lexico-semantic data and information from generic patterns.