Towards automatic acquisition of a fully sense tagged corpus for persian

  • Authors:
  • Bahareh Sarrafzadeh;Nikolay Yakovets;Nick Cercone;Aijun An

  • Affiliations:
  • Department of Computer Science and Engineering, York University, Canada;Department of Computer Science and Engineering, York University, Canada;Department of Computer Science and Engineering, York University, Canada;Department of Computer Science and Engineering, York University, Canada

  • Venue:
  • ISMIS'11 Proceedings of the 19th international conference on Foundations of intelligent systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Sense tagged corpora play a crucial role in Natural Language Processing, particularly in Word Sense Disambiguation and Natural Language Understanding. Since semantic annotations are usually performed by humans, such corpora are limited to a handful of tagged texts and are not available for many languages with scarce resources including Persian. The shortage of efficient, reliable linguistic resources and fundamental text processing modules for Persian have been a challenge for researchers investigating this language. We employ a newlyproposed cross-lingual sense disambiguation algorithm to automatically create large sense tagged corpora. The initial evaluation of the tagged corpus indicates promising results.