A semantic proximity based system of Arabic text indexation

  • Authors:
  • Taher Zaki;Driss Mammass;Abdellatif Ennaji

  • Affiliations:
  • Ibn Zohr University, Agadir, Morrocco;Ibn Zohr University, Agadir, Morrocco;LITIS, University of Rouen, France

  • Venue:
  • ICISP'10 Proceedings of the 4th international conference on Image and signal processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we extended the vectorial model of Salton [9], [11], [12] and [14], by adapting the TF-IDF parameter by its combination with the Okapi formula for index terms extraction and evaluation of the in order to identify the relevant concepts which represent a document. Indeed, we have proposed a new measure TFIDF-ABR which takes in consideration the concept of semantic vicinity using a measure of similarity between terms by combining the calculation of TF-IDF-Okapi with a kernel approach (Radial Basis function). This indexation approach allows a contextual and semantic research. In order to have a robust descriptor index, we used not only a semantic graph to highlight the semantic connections between terms, but also an auxiliary dictionary to increase the connectivity of the constructed graph and therefore the semantic weight of indexation words.