Selective Sampling Using the Query by Committee Algorithm
Machine Learning
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Arabic morphological analysis techniques: a comprehensive survey
Journal of the American Society for Information Science and Technology
Broken plural detection for arabic information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Arabic Named Entity Recognition from Diverse Text Types
GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
Arabic Natural Language Processing
Arabic Natural Language Processing
Near real time information mining in multilingual news
Proceedings of the 18th international conference on World wide web
ANERsys: An Arabic Named Entity Recognition System Based on Maximum Entropy
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Arabic named entity recognition using optimized feature sets
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Arabic Natural Language Processing: Challenges and Solutions
ACM Transactions on Asian Language Information Processing (TALIP)
Arabic named entity recognition: using features extracted from noisy data
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
Arabic Named Entity Recognition: A Feature-Driven Study
IEEE Transactions on Audio, Speech, and Language Processing
Arabic entity graph extraction using morphology, finite state machines, and graph transformations
CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Aircraft interior failure pattern recognition utilizing text mining and neural networks
Journal of Intelligent Information Systems
Hi-index | 0.00 |
Arabic is the most widely spoken language in the Arab World. Most people of the Islamic World understand the Classic Arabic language because it is the language of the Qur'an. Despite the fact that in the last decade the number of Arabic Internet users (Middle East and North and East of Africa) has increased considerably, systems to analyze Arabic digital resources automatically are not as easily available as they are for English. Therefore, in this work, an attempt is made to build a real time Named Entity Recognition system that can be used in web applications to detect the appearance of specific named entities and events in news written in Arabic. Arabic is a highly inflectional language, thus we will try to minimize the impact of Arabic affixes on the quality of the pattern recognition model applied to identify named entities. These patterns are built up by processing and integrating different gazetteers, from DBPedia ( http://dbpedia.org/About , 2009) to GATE (A general architecture for text engineering, 2009) and ANERGazet ( http://users.dsic.upv.es/grupos/nle/?file=kop4.php ).