Rational kernels for arabic text classification

  • Authors:
  • Attia Nehar;Djelloul Ziadi;Hadda Cherroun

  • Affiliations:
  • Laboratoire d'Informatique et Mathématiques, Université Amar Télidji Laghouat, Algérie;Laboratoire LITIS - EA 4108, Normandie Université, Rouen, France;Laboratoire d'Informatique et Mathématiques, Université Amar Télidji Laghouat, Algérie

  • Venue:
  • SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many stemming techniques are used in the context of Arabic Text Classification. In this paper, we show the effect of stemming on classification systems. We introduce a new stemming technique -approximate stemming- based on the use of Arabic patterns. These patterns are modeled using transducers and stemming is done without depending on any dictionary. Using transducers for stemming words, documents are transformed into finite state transducers. This allow us to use rational kernels as a framework for Arabic Text Classification. Experiments show that, when compared with other approaches, our approach is more effective specially in term of Accuracy, Recall and F1.