Discovering lexical information by tagging Arabic newspaper text

Authors:
Saleem Abuleil;Martha Evens
Affiliations:
Illinois Institute of Technology, Chicago, IL;Illinois Institute of Technology, Chicago, IL
Venue:
Semitic '98 Proceedings of the Workshop on Computational Approaches to Semitic Languages
Year:
1998

Citing 9
Cited 2

Information extraction

Communications of the ACM
Categorizing and standardizing proper nouns for efficient information retrieval

Corpus processing for lexical acquisition
An arabic lexicon to support information retrieval, parsing, and text generation

An arabic lexicon to support information retrieval, parsing, and text generation
Coping with ambiguity and unknown words through probabilistic models

Computational Linguistics - Special issue on using large corpora: II
Automatic rule induction for unknown-word guessing

Computational Linguistics
A simple rule-based part of speech tagger

ANLC '92 Proceedings of the third conference on Applied natural language processing
Disambiguation of proper names in text

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Automatic processing of proper names in texts

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Morphological analysis and synthesis by automated discovery and acquisition of linguistic rules

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2

Acquisition system for Arabic noun morphology

SEMITIC '02 Proceedings of the ACL-02 workshop on Computational approaches to semitic languages
QARAB: a question answering system to support the Arabic language

SEMITIC '02 Proceedings of the ACL-02 workshop on Computational approaches to semitic languages

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we describe a system for building an Arabic lexicon automatically by tagging Arabic newspaper text. In this system we are using several techniques for tagging the words in the text and figuring out their types and their features. The major techniques that we are using are: finding phrases, analyzing the affixes of the words, and analyzing their patterns. Proper nouns are particularly difficult to identify in the Arabic language; we describe techniques for isolating them.