Using Translation Heuristics to Improve a Multimodal and Multilingual Information Retrieval System

  • Authors:
  • Miguel Ángel García-Cumbreras;Maria Teresa Martín-Valdivia;Luis Alfonso Ureña-López;Manuel Carlos Díaz-Galiano;Arturo Montejo-Ráez

  • Affiliations:
  • University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain

  • Venue:
  • WILF '07 Proceedings of the 7th international workshop on Fuzzy Logic and Applications: Applications of Fuzzy Sets Theory
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Nowadays, the multimodal nature of the World Wide Web is an evidence. Web sites which include video files, pictures, music and text have become widespread. Furthermore, multimodal collections in several languages demand to apply multilingual information retrieval strategies. This paper describes a new retrieval technique applied on a multimodal and multilingual system that have been tested on two different multilingual image collections. The system applies several machine translators and implements some novel heuristics. These heuristics explore a variety of ways to combine the translations obtained from the given set of translators, and the configuration of the retrieval model by using different weighting functions, and also studying the effect of pseudo-relevance feedback(PRF) on this domain. Our results show interesting effects by these variations, allowing the determination of the parameters for the best retrieval model on this data and reporting the loss in performance on each language.