Foundations of statistical natural language processing
Foundations of statistical natural language processing
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Shallow parsing with conditional random fields
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Language model based arabic word segmentation
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Entity extraction without language-specific resources
COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Named entity recognition using hundreds of thousands of features
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
CONLL '03 Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 - Volume 4
Multilingual named entity extraction and translation from text and speech
Multilingual named entity extraction and translation from text and speech
ANERsys: An Arabic Named Entity Recognition System Based on Maximum Entropy
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Arabic named entity recognition using optimized feature sets
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Person name entity recognition for Arabic
Semitic '07 Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources
Nonparametric Bayesian machine transliteration with synchronous adaptor grammars
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Integrating rule-based system with classification for arabic named entity recognition
CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Recall-oriented learning of named entities in Arabic Wikipedia
EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
A hybrid approach to Arabic named entity recognition
Journal of Information Science
Crime profiling for the Arabic language using computational linguistic techniques
Information Processing and Management: an International Journal
Hi-index | 0.00 |
This paper introduces simplified yet effective features that can robustly identify named entities in Arabic text without the need for morphological or syntactic analysis or gazetteers. A CRF sequence labeling model is trained on features that primarily use character n-gram of leading and trailing letters in words and word n-grams. The proposed features help overcome some of the morphological and orthgraphic complexities of Arabic. In comparing to results in the literature using Arabic specific features such POS tags on the same dataset and same CRF implementation, the results in this paper are lower by 2 F-measure points for locations, but are better by 8 points for organizations and 9 points for persons.