Conceptual indexing for multilingual information retrieval

  • Authors:
  • Jacques Guyot;Saïd Radhouani;Gilles Falquet

  • Affiliations:
  • Centre Universitaire d’Informatique, University of Geneva, Genève 4, Switzerland;Centre Universitaire d’Informatique, University of Geneva, Genève 4, Switzerland;Centre Universitaire d’Informatique, University of Geneva, Genève 4, Switzerland

  • Venue:
  • CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a translation-free technique for multilingual information retrieval. This technique is based on an ontological representation of documents and queries. For each language, we use a dictionary (set of lexical reference for concepts) to map a term to its corresponding concept. The same mapping is applied to each document and each query. Then, we use a classic vector space model based on concept for indexing and querying the document corpus. The main advantages of our approach are: no merging phase is required; no dependency on automatic translators between all pairs of languages; and adding a new language only requires a new mapping dictionary to be added into the multilingual ontology. Experimental results on the CLEF 2005 multi8 collection show that this approach is efficient, even with relatively small and low fidelity dictionaries and without word sense disambiguation.