Arabic script language identifications using adaptive neural network

  • Authors:
  • Ali Selamat;Ng Choon-Ching

  • Affiliations:
  • Universiti Teknologi Malaysia;Universiti Teknologi Malaysia

  • Venue:
  • ACST '08 Proceedings of the Fourth IASTED International Conference on Advances in Computer Science and Technology
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Globalization has led to unlimited information between geographically remote locations and insight of a global common market. When constructing website applications for use on various industries, developers need to deal with a wide range of users from different countries. Thus, multilingual system is implemented in order to make available multilingual environment in those applications. However, it is time-consuming to define all the possible languages for multilingual system manually, it would be desirable to automate the adoption of language identification for text-based documents. To address this need, we introduce language identification of Arabic script documents with letter frequency based. Techniques used for identification are fuzzy ARTMAP and default ARTMAP, which are belong to neural network architectures that perform incremental supervised learning. Arabic script documents such as Arabic, Persian and Urdu were used for performing language identification. From the experiments, we have found that fuzzy ARTMAP has performed better than the default ARTMAP in Arabic script language identification.