Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
Improved statistical alignment models
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Arabic Natural Language Processing
Arabic Natural Language Processing
Segmentation for English-to-Arabic statistical machine translation
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Arabic preprocessing schemes for statistical machine translation
NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers
Hi-index | 0.00 |
In this paper we show how to achieve a significant increase in Bleu score in case of English to Arabic Statistical Machine Translation (SMT) by making some preprocessing for both English and Arabic and also using Morphological splitting of Arabic. The preprocessing involves numbers, dates and person names clustering. The morphological splitting uses Columbia University Arabic language analysis tool (MADA) and the SMT is using MOSES and GIZA++ tools.