Classifying Amharic news text using self-organizing maps

  • Authors:
  • Samuel Eyassu;Björn Gambäck

  • Affiliations:
  • Addis Ababa University, Ethiopia;Swedish Institute of Computer Science, Kista, Sweden

  • Venue:
  • Semitic '05 Proceedings of the ACL Workshop on Computational Approaches to Semitic Languages
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper addresses using artificial neural networks for classification of Amharic news items. Amharic is the language for countrywide communication in Ethiopia and has its own writing system containing extensive systematic redundancy. It is quite dialectally diversified and probably representative of the languages of a continent that so far has received little attention within the language processing field. The experiments investigated document clustering around user queries using Self-Organizing Maps, an unsupervised learning neural network strategy. The best ANN model showed a precision of 60.0% when trying to cluster unseen data, and a 69.5% precision when trying to classify it.