Evaluation of perstem: a simple and efficient stemming algorithm for Persian

  • Authors:
  • Amir Hossein Jadidinejad;Fariborz Mahmoudi;Jon Dehdari

  • Affiliations:
  • Electrical and Computer Engineering Department, Islamic Azad University;Electrical and Computer Engineering Department, Islamic Azad University;Department of Linguistics, The Ohio State University

  • Venue:
  • CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Persian is a challenging language in the field of NLP. Rightto-left orthography, complex morphology, complicated grammatical rules, and different forms of letters make it an interesting language for NLP research. In this paper we measure the effectiveness of a simple and efficient stemming algorithm, Perstem, on Persian information retrieval. Our experiments on the Hamshahri corpus at CLEF2009 show that the Perstem algorithm greatly improved both precision (+91%) and recall (+43%).