Rule Parser for Arabic Stemmer

  • Authors:
  • Imad A. Al-Sughaiyer;Ibrahim A. Al-Kharashi

  • Affiliations:
  • -;-

  • Venue:
  • TSD '02 Proceedings of the 5th International Conference on Text, Speech and Dialogue
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

Arabic language exhibits a complex but very regular morphological structure that greatly affects its automation. Current available morphological analysis techniques for the Arabic language are based on heavy computational processes and/or the need for large amount of associated data. Utilizing existed morphological techniques greatly degrade the efficiency of some natural language applications such as information retrieval system. This paper proposed a new Arabic morphological analysis technique. The technique is based on the pattern similarity of words derived from different roots. Unique patterns are extended and coded as rules that encode morphological characteristics. The technique does not require either complex computation or associated data yet adjustable to maintain enough accuracy. This technique utilizes a very simple parser to scan coded rules and decompose a given Arabic word into its morphological components.