Automatic selection of heterogeneous syntactic features in semantic similarity of polish nouns

Authors:
Maciej Piasecki;Stanisław Szpakowicz;Bartosz Broda
Affiliations:
Institute of Applied Informatics, Wrocław University of Technology, Poland;School of Information Technology and Engineering, University of Ottawa and Institute of Computer Science, Polish Academy of Sciences;Institute of Applied Informatics, Wrocław University of Technology, Poland
Venue:
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Year:
2007

Citing 11
Cited 6

Evaluation techniques for automatic semantic extraction: comparing syntactic and window based approaches

Corpus processing for lexical acquisition
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Similarity-based methods for word sense disambiguation

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Using syntactic dependency as local context to resolve word sense ambiguity

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Automatic retrieval and clustering of similar words

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Co-occurrence Retrieval: A Flexible Framework for Lexical Distributional Similarity

Computational Linguistics
Automatic Discovery of Part-Whole Relations

Computational Linguistics
Espresso: leveraging generic patterns for automatically harvesting semantic relations

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
New experiments in distributional representations of synonymy

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Semantic similarity measure of polish nouns based on linguistic features

BIS'07 Proceedings of the 10th international conference on Business information systems
Effective architecture of the polish tagger

TSD'06 Proceedings of the 9th international conference on Text, Speech and Dialogue

Rank-Based Transformation in Measuring Semantic Relatedness

Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
The WordNet Weaver: Multi-criteria Voting for Semi-automatic Extension of a Wordnet

Canadian AI '09 Proceedings of the 22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence
Correction of medical handwriting OCR based on semantic similarity

IDEAL'07 Proceedings of the 8th international conference on Intelligent data engineering and automated learning
Automatic acquisition of wordnet relations by distributionally supported morphological patterns extracted from Polish corpora

TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Heterogeneous knowledge sources in graph-based expansion of the polish wordnet

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
A supervised method of feature weighting for measuring semantic relatedness

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present experiments with a variety of corpus-based measures applied to the problem of constructing semantic similarity functions for Polish nouns. Rich inflection in Polish allows us to acquire useful syntactic features without parsing; morphosyntactic restrictions checked in a large enough window provide sufficiently useful data. A novel feature selection method gives the accuracy of 86% on the WordNet-based synonymy test, an improvement of 5% over the previous results.