Software—Practice & Experience
Arabic morphology generation using a concatenative strategy
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Arabic morphological analysis techniques: a comprehensive survey
Journal of the American Society for Information Science and Technology
Arabic finite-state morphological analysis and generation
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Message Understanding Conference-6: a brief history
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Arabic GramCheck: a grammar checker for Arabic: Research Articles
Software—Practice & Experience
Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Arabic Computational Morphology: Knowledge-based and Empirical Methods
Arabic Computational Morphology: Knowledge-based and Empirical Methods
NERA: Named Entity Recognition for Arabic
Journal of the American Society for Information Science and Technology
Arabic named entity recognition using optimized feature sets
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Arabic dialect processing tutorial
NAACL-Tutorials '07 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Tutorial Abstracts
Morphological analysis and generation for Arabic dialects
Semitic '05 Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages
Issues in Arabic orthography and morphology analysis
Semitic '04 Proceedings of the Workshop on Computational Approaches to Arabic Script-based Languages
Climbing the path to grammar: a maximum entropy model of subject/object learning
PMHLA '05 Proceedings of the Workshop on Psychocomputational Models of Human Language Acquisition
Semitic '07 Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources
Developing a competitive HMM arabic POS tagger using small training corpora
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Improving arabic part-of-speech tagging through morphological analysis
ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Expert Systems with Applications: An International Journal
A real time Named Entity Recognition system for Arabic text mining
Language Resources and Evaluation
Off-line handwritten arabic word recognition using SVMs with normalized poly kernel
ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part II
Time for More Languages: Temporal Tagging of Arabic, Italian, Spanish, and Vietnamese
ACM Transactions on Asian Language Information Processing (TALIP)
A hybrid approach to Arabic named entity recognition
Journal of Information Science
Hi-index | 0.00 |
The Arabic language presents researchers and developers of natural language processing (NLP) applications for Arabic text and speech with serious challenges. The purpose of this article is to describe some of these challenges and to present some solutions that would guide current and future practitioners in the field of Arabic natural language processing (ANLP). We begin with general features of the Arabic language in Sections 1, 2, and 3 and then we move to more specific properties of the language in the rest of the article. In Section 1 of this article we highlight the significance of the Arabic language today and describe its general properties. Section 2 presents the feature of Arabic Diglossia showing how the sociolinguistic aspects of the Arabic language differ from other languages. The stability of Arabic Diglossia and its implications for ANLP applications are discussed and ways to deal with this problematic property are proposed. Section 3 deals with the properties of the Arabic script and the explosion of ambiguity that results from the absence of short vowel representations and overt case markers in contemporary Arabic texts. We present in Section 4 specific features of the Arabic language such as the nonconcatenative property of Arabic morphology, Arabic as an agglutinative language, Arabic as a pro-drop language, and the challenge these properties pose to ANLP. We also present solutions that have already been adopted by some pioneering researchers in the field. In Section 5 we point out to the lack of formal and explicit grammars of Modern Standard Arabic which impedes the progress of more advanced ANLP systems. In Section 6 we draw our conclusion.