Towards automatic acquisition of a fully sense tagged corpus for persian

Authors:
Bahareh Sarrafzadeh;Nikolay Yakovets;Nick Cercone;Aijun An
Affiliations:
Department of Computer Science and Engineering, York University, Canada;Department of Computer Science and Engineering, York University, Canada;Department of Computer Science and Engineering, York University, Canada;Department of Computer Science and Engineering, York University, Canada
Venue:
ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
Year:
2011

Citing 11
Cited 0

An automatic method for generating sense tagged corpora

AAAI '99/IAAI '99 Proceedings of the sixteenth national conference on Artificial intelligence and the eleventh Innovative applications of artificial intelligence conference innovative applications of artificial intelligence
Distinguishing systems and distinguishing senses: new evaluation methods for Word Sense Disambiguation

Natural Language Engineering
One sense per discourse

HLT '91 Proceedings of the workshop on Speech and Natural Language
A semantic concordance

HLT '93 Proceedings of the workshop on Human Language Technology
Building a sense tagged corpus with open mind word expert

WSD '02 Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
Word Sense Disambiguation of Farsi Homographs Using Thesaurus and Corpus

GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
WordNet::SenseRelate::AllWords: a broad coverage word sense tagger that maximizes semantic relatedness

NAACL-Demonstrations '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Demonstration Session
Extended gloss overlaps as a measure of semantic relatedness

IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
Extracting sense-disambiguated example sentences from parallel corpora

WDE '09 Proceedings of the 1st Workshop on Definition Extraction
Cross-lingual word sense disambiguation for languages with scarce resources

Canadian AI'11 Proceedings of the 24th Canadian conference on Advances in artificial intelligence
Crossing parallel corpora and multilingual lexical databases for WSD

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Sense tagged corpora play a crucial role in Natural Language Processing, particularly in Word Sense Disambiguation and Natural Language Understanding. Since semantic annotations are usually performed by humans, such corpora are limited to a handful of tagged texts and are not available for many languages with scarce resources including Persian. The shortage of efficient, reliable linguistic resources and fundamental text processing modules for Persian have been a challenge for researchers investigating this language. We employ a newlyproposed cross-lingual sense disambiguation algorithm to automatically create large sense tagged corpora. The initial evaluation of the tagged corpus indicates promising results.