Comparing words, stems, and roots as index terms in an Arabic Information Retrieval System
Journal of the American Society for Information Science
Towards an error-free Arabic stemming
Proceedings of the 2nd ACM workshop on Improving non english web searching
Automatic tagging of Arabic text: from raw text to base phrase chunks
HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Hi-index | 0.01 |
This paper proposes the building of a stemmer for the Arabic language. This stemmer is largely based on pattern matching and pattern strength techniques. Stemmers are algorithms to extract root from a word by removing its affixes. Stemming has been applied for large number of applications, such as: indexing, information retrieval systems, and web search engines. This paper will also proposes the application of stemming as a pre-processing stage in a dialogue system (DS). The proposed stemmer was compared with three other well known stemmers and achieved favourable accuracy.