Automatic acquisition of wordnet relations by distributionally supported morphological patterns extracted from Polish corpora

Authors:
Roman Kurc;Maciej Piasecki;Stan Szpakowicz
Affiliations:
Institute of Informatics, Wrocław University of Technology, Poland;Institute of Informatics, Wrocław University of Technology, Poland;SITE, University of Ottawa, Canada and Institute of Computer Science, Polish Academy of Sciences, Poland
Venue:
TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Year:
2010

Citing 5
Cited 0

Automatic acquisition of hyponyms from large text corpora

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Espresso: leveraging generic patterns for automatically harvesting semantic relations

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Automatically creating datasets for measures of semantic relatedness

LD '06 Proceedings of the Workshop on Linguistic Distances
Reducing semantic drift with bagging and distributional similarity

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Automatic selection of heterogeneous syntactic features in semantic similarity of polish nouns

TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue

Quantified Score

Hi-index	0.00

Visualization

Abstract

Espresso is a pattern-based algorithm of extracting lexical-semantic relations, defined for English. We present its adaptation to Polish. We consider not only the technicalities such as the availability of language-processing tools for Polish, but also pattern structures which leverage the specificity of a strongly inflected language. We propose a new method of computing the reliability measure of extraction; this leads to a modified algorithm which we have named Estratto. In this paper we investigate the influence of additional lexico-semantic data and information from generic patterns.