A rule-based Arabic stemming algorithm

  • Authors:
  • Tengku Mohd T. Sembok;Belal Mustafa Abu Ata;Zainab Abu Bakar

  • Affiliations:
  • Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, Bangi, Malaysia;Department of Computer Science, Bahrain University, Bahrain;Faculty of Computer and Mathematical Sciences, Universiti Teknology MARA, Shah Alam, Malaysia

  • Venue:
  • ECC'11 Proceedings of the 5th European conference on European computing conference
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Stemming is used in information retrieval systems to reduce variant word forms to common roots in order to improve retrieval effectiveness. As in other languages, there is a need for an effective stemming algorithm for the indexing and retrieval of Arabic documents. The Arabic stemming algorithm developed by Al-Omari is studied and new versions proposed to enhance its performance. The improvements relate to the order in which the dictionary is looked-up and the order in which the morphological rules are applied.