Hybrid system for extracting and classifying Arabic proper names

Authors:
Saleem Abuleil
Affiliations:
Information Systems Department, Chicago State University, Chicago, IL
Venue:
AIKED'06 Proceedings of the 5th WSEAS International Conference on Artificial Intelligence, Knowledge Engineering and Data Bases
Year:
2006

Citing 5
Cited 0

Arabic language parser

International Journal of Man-Machine Studies
An Algorithm that Learns What‘s in a Name

Machine Learning - Special issue on natural language learning
Disambiguation of proper names in text

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Unsupervised learning of generalized names

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A WordNet-based approach to Named Entities recognition

SEMANET '02 Proceedings of the 2002 workshop on Building and using semantic networks - Volume 11

Quantified Score

Hi-index	0.00

Visualization

Abstract

Many applications such as information extraction systems, question answering systems, text summarization systems, information retrieval systems, etc. rely on proper names as one main tool to achieve their goals. In the Arabic language there is a big challenge for finding those proper names in the text because they do not start with capital letter as in many other languages, nor they have special sign to identify them and distinguish them from other words in the text. Little research has been conducted in this area; most efforts have been done based on a number of heuristic rules used to find names in the text, some used graphs to represent the words that might form a name and the relationships between them, some they use statistical methods for this reason. In this paper we describe a hybrid system built based on both statistical methods and predefined rules.