Multilingual Indexing Based on Ontologies

  • Authors:
  • Catherine Roussey;Sylvie Calabretto;Farah Harrathi;Mohamed Gammoudi

  • Affiliations:
  • LIRIS CNRS UMR 5205-Université Lyon 1, Bâtiment Nautibus 8, boulevard Niels Bohr F-69622 Villeurbanne Cedex, {firstname.lastname}@liris.cnrs.fr;LIRIS CNRS UMR 5205-INSA de Lyon, Bâtiment Blaise Pascal 7, avenue Jean Capelle, F-69621 Villeurbanne Cedex, {firstname.lastname}@insa-lyon.fr;LIRIS CNRS UMR 5205-INSA de Lyon, Bâtiment Blaise Pascal 7, avenue Jean Capelle, F-69621 Villeurbanne Cedex, {firstname.lastname}@insa-lyon.fr;Unité de recherche en Algorithmique, Programmation et Heuristique ISIG Kairoun-Université de Kairoun (Tunisie), mohamed.gammoudi@fst.rnu.tn

  • Venue:
  • Proceedings of the 2006 conference on Leading the Web in Concurrent Engineering: Next Generation Concurrent Engineering
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This article deals with multilingual document indexing. We propose an indexing method based on several stages. First of all the most important terms of the document are extracted using general characteristics of languages and statistical methods. Thus, term extraction stages can be applied to any document whatever the document language is. Secondly, our indexing method uses a multilingual ontology in order to find the most relevant concept representing the document content. Our method can be applied to a multilingual corpus containing document written in different languages. This indexing procedure is part of a Multilingual Document System untitled SyDoM, which manages XML documents.