Statistical thesaurus construction for a morphologically rich language

  • Authors:
  • Chaya Liebeskind;Ido Dagan;Jonathan Schler

  • Affiliations:
  • Bar-Ilan University Ramat-Gan, Israel;Bar-Ilan University Ramat-Gan, Israel;Bar-Ilan University Ramat-Gan, Israel

  • Venue:
  • SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Corpus-based thesaurus construction for Morphologically Rich Languages (MRL) is a complex task, due to the morphological variability of MRL. In this paper we explore alternative term representations, complemented by clustering of morphological variants. We introduce a generic algorithmic scheme for thesaurus construction in MRL, and demonstrate the empirical benefit of our methodology for a Hebrew thesaurus.