An Arabic morphological system
IBM Systems Journal
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
A study of trigrams and their feasibility as index terms in a full text information retrieval system
A study of trigrams and their feasibility as index terms in a full text information retrieval system
Comparing words, stems, and roots as index terms in an Arabic Information Retrieval System
Journal of the American Society for Information Science
Design and implementation of automatic indexing for information retrieval with Arabic documents
Journal of the American Society for Information Science
Stemming methodologies over individual query words for an Arabic information retrieval system
Journal of the American Society for Information Science
Information Retrieval
Multi-tape two-level morphology: a case study in semitic non-linear morphology
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Arabic finite-state morphological analysis and generation
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Arabic morphological analysis techniques: a comprehensive survey
Journal of the American Society for Information Science and Technology
Dictionary-based techniques for cross-language information retrieval
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
Character contiguity in N-gram-based word matching: the case for Arabic text searching
Information Processing and Management: an International Journal
Applying Authorship Analysis to Extremist-Group Web Forum Messages
IEEE Intelligent Systems
Acquisition system for Arabic noun morphology
SEMITIC '02 Proceedings of the ACL-02 workshop on Computational approaches to semitic languages
YASS: Yet another suffix stripper
ACM Transactions on Information Systems (TOIS)
Error correction vs. query garbling for Arabic OCR document retrieval
ACM Transactions on Information Systems (TOIS)
Effect of OCR error correction on Arabic retrieval
Information Retrieval
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
POS tagging of dialectal Arabic: a minimally supervised approach
Semitic '05 Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages
A comparison study of some Arabic root finding algorithms
Journal of the American Society for Information Science and Technology
ACM Transactions on Asian Language Information Processing (TALIP)
Word-Based correction for retrieval of arabic OCR degraded documents
SPIRE'06 Proceedings of the 13th international conference on String Processing and Information Retrieval
Applying authorship analysis to arabic web content
ISI'05 Proceedings of the 2005 IEEE international conference on Intelligence and Security Informatics
Hi-index | 0.00 |
We present a clustering algorithm for Arabic words sharing the same root. Root based clusters can substitute dictionaries in indexing for IR. Modifying Adamson and Boreham (1974), our Two-stage algorithm applies light stemming before calculating word pair similarity coefficients using techniques sensitive to Arabic morphology. Tests show a successful treatment of infixes and accurate clustering to up to 94.06% for unedited Arabic text samples, without the use of dictionaries.