Communications of the ACM
Categorizing and standardizing proper nouns for efficient information retrieval
Corpus processing for lexical acquisition
An arabic lexicon to support information retrieval, parsing, and text generation
An arabic lexicon to support information retrieval, parsing, and text generation
Coping with ambiguity and unknown words through probabilistic models
Computational Linguistics - Special issue on using large corpora: II
Automatic rule induction for unknown-word guessing
Computational Linguistics
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Disambiguation of proper names in text
ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Automatic processing of proper names in texts
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Morphological analysis and synthesis by automated discovery and acquisition of linguistic rules
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
Acquisition system for Arabic noun morphology
SEMITIC '02 Proceedings of the ACL-02 workshop on Computational approaches to semitic languages
QARAB: a question answering system to support the Arabic language
SEMITIC '02 Proceedings of the ACL-02 workshop on Computational approaches to semitic languages
Hi-index | 0.00 |
In this paper we describe a system for building an Arabic lexicon automatically by tagging Arabic newspaper text. In this system we are using several techniques for tagging the words in the text and figuring out their types and their features. The major techniques that we are using are: finding phrases, analyzing the affixes of the words, and analyzing their patterns. Proper nouns are particularly difficult to identify in the Arabic language; we describe techniques for isolating them.